hackalive

Members

View Profile See their activity

Posts
652
Joined
March 17, 2010
Last visited
December 29, 2013

Content Type

All Activity

Profiles

Forums

Topics
Posts

Everything posted by hackalive

Data Storagre for seach engine

hackalive posted a topic in Application Design

Thanks to ignace and MrAdam I have managed to complete my web crawler script for a seach engine. Considering the scale this may grow to I need a clean and efficient DB design, so I am asking how you guys would do it. It would need somewhere for all the keywords and be able to link them to certain sites One table (tbl_site) might work like this |id|url|title|description Just looking for your guys opinion and how you think this DB could/may/should work Thanks in advance
- June 9, 2010
- 6 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

PS thanks very very very much all those who have helped me sooooo much in the past, especially ignace who is my saviour, thanks a million And thanks for this post now that it is closing goes to ignace and MrAdam
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Well okay thorpe give me an email adress and I'll send you the Beta when its ready (of course as all OS virtualisation projects this is probably a few months away). And I was warned about you by people on other forums and in perosn about being a ... well frankly a stuck up smartass. At least people like ignace and MrAdam and many others offer advice and comments without the stuckup smartass attitude, no wander people leave this forum and never return.
- June 8, 2010
- 26 replies
Securing Uploaded Video from being Download

hackalive replied to shiningworld_4u's topic in PHP Coding Help

dont listen to thorpe, anything is possible it just depends how much work, time and effort you are willing to put in, time and time again I hear on this and other forums "thats not possible becuase..." but guess what most have ended up working (with loads of work and stress) and the others I have some ideas on how maybe they can work (but dont have a lot of time to do it at the moment). So if you are determined it will happen
- June 8, 2010
- 8 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Well thorpe my OS virtulization project has started and is going well, so umm yeah. Just trying to clean this cURL thing out of my inbox (so to speak)
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

<?php $ch = curl_init(); curl_setopt($ch, CURLOPT_USERAGENT, 'HackAliveCrawler htpp://mysite.com/hackalivecrawler'); ?> Yeah so if I do the above then do the crawl using cURL the user Agent will be recorded as HackAliveCrawler yes?
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

okay thorpe so what do I set $ch as?
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

MrAdam for this code <?php curl_setopt($ch, CURLOPT_USERAGENT, 'HackAliveCrawler htpp://mysite.com/hackalivecrawler'); echo $_SERVER['HTTP_USER_AGENT']; echo "|"; echo $_SERVER['USER_AGENT']; ?>
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

yeah I am just doing it on a smaple page at the moment with no curl, just exactly what I have posted
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

okay slight problem, this code <?php header('User-Agent: HackAliveCrawler http://mysite.com/hackalivecrawler'); echo $_SERVER['HTTP_USER_AGENT']; ?> returns not any reason why? if I use USER_AGENT and not HTTP_USER_AGENT it returns nothing thanks once again
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

thanks again
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Yes with PHP thanks a million once again to ignace (who many a time has saved me from trauling the entiere internet) and also to MrAdam thanks.
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Becuase MLBot have this is what I want to achieve
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Okay so I get it is part of the header request so how then can I set my own like MLBot and Google Crawler for when it is executed via a cron job?
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

oh bye the way it will be running as part of a cron job and somethimes via direct browser. Thought I might just add this as it may affect the User-Agent directive mentioned
- June 8, 2010
- 26 replies
Crawler (2)

hackalive replied to hackalive's topic in PHP Coding Help

Okay thanks once again ignace, just unsure how to set the User-Agent directive, if you know a good link for this or can tell me it will be much apprreciated. Thanks again ignace
- June 8, 2010
- 26 replies
Crawler (2)

hackalive posted a topic in PHP Coding Help

Okay so thorpe has locked my post and pointed out it is far too vague. So here I go again. Firstly I know how to build a basic PHP cURL web page crawler. Now my question is more specific to this, how do I get Site Stats to recognise the crawler as other ones such as Google Bot and MLBot are listed (MLbot http://www.metadatalabs.com/mlbot, so mine would be HackAliveCrawler htpp://mysite.com/hackalivecrawler or similar...). And how do I get it to recognise ROBOTS meta tags, ie NOINDEX, NOFOLLOW, or INDEX, FOLLOW etc and for robots.txt (to do this I need to first part to work, the name set part (eg GoogleBot or MLBot etc). Hope this is less vague and can produce some answers. Links to tutorials or other forums that will achieve these desired result as well as personal opinion and commernts are most welcome. Many thanks in advance.
- June 8, 2010
- 26 replies
Crawler

hackalive posted a topic in PHP Coding Help

Hello I am looking to build a web crawler such as MLBot. It must recognise robots.txt and ROBOTS meta tag, but in saying that when a site such as Wordpress shows visitor stats it lists the crawler (eg MLBot http://www.metadatalabs.com/mlbot). So how can I build a crawler that will list as HackAliveBot or HackAliveCrawler and will recognise robots.txt and ROBOTS meta tag. Thanks so much in advance
- June 8, 2010
- 1 reply
Markup Language

hackalive replied to hackalive's topic in Miscellaneous

maybe so people can see where this is headed, here is the first "tag" i want to implement <ha:group gid="66666"></ha:group> so it would be used like this <ha:group gid="66666">Hello, you are part of group 66666<ha:else>you are NOT part of group 66666</ha:else></ha:group> So what I am after is how to make the XMLNS sheet that recognises and converts the tags (how to parse this, or any tags). All suggestions are welcome. I know I need XML namespace (XMLNS) but need to know how to build up the XMLNS sheet etc to achive the above sample tag. Thanks guys in advance
- June 5, 2010
- 27 replies
Markup Language

hackalive replied to hackalive's topic in Miscellaneous

also if anyone knows another forum or site to post my question on please let me know, thanks
- June 5, 2010
- 27 replies
Markup Language

hackalive replied to hackalive's topic in Miscellaneous

anyone know how I can implement the XML parser, XML namespace (XSLT template) to make this all work
- June 5, 2010
- 27 replies
return as specific location

hackalive replied to hackalive's topic in PHP Coding Help

Perfect, thanks so much ignace and Tazerenix, very very very much appreciated
- June 5, 2010
- 11 replies
return as specific location

hackalive replied to hackalive's topic in PHP Coding Help

okay what you suggested ignace works except..... now all the themeting etc is gone (it has replace <PL1> with the file but all the surrounding stuff for <PL1> which should stay is also gone. I have ob_start(); $A = str_replace('<A>', require_once($dir.'/'.$file), $A); $A = ob_get_contents(); return $A;
- June 5, 2010
- 11 replies
return as specific location

hackalive replied to hackalive's topic in PHP Coding Help

@ignace, yes it is returning a 1 @Tazerenix, yes that PHP needs to be there, I need to achive this somehow with the filr structure I have going So if anyone can think of a way to achive this let me know please, thanks in advance
- June 5, 2010
- 11 replies
return as specific location

hackalive replied to hackalive's topic in PHP Coding Help

Tazerenix I have just tried you suggestion and it does'nt handle the PHP
- June 5, 2010
- 11 replies

Sign In

hackalive

Posts

Joined

Last visited

Content Type

Profiles

Forums

Everything posted by hackalive

Data Storagre for seach engine

Crawler (2)

Crawler (2)

Securing Uploaded Video from being Download

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler (2)

Crawler

Markup Language

Markup Language

Markup Language

return as specific location

return as specific location

return as specific location

return as specific location

Browse

Activity

Important Information