At the prime age of 49 I've decided to start attending night school to get some basic/advance knowledge of programming.
I want to make a web spider for my website but have no idea where to start will anyone be able to help ??
My structure that I think will work will be as follows -
index page.php - just mock up a site home page and put an 'include' for a search bar uptop. Make sure its .php otherwise it wont work
Results page - for the search results.
Spider - for searching through the pages. << (PROBLEM)
Database - for storing all the data collected by the spider.
Any help will be much appreciated, I've exhausted all other possible avenues and have reached a dead end...
Web Spider....
Started by OldMac, Jul 17 2009 02:53 PM
19 replies to this topic
#1
Posted 17 July 2009 - 02:53 PM
|
|
|
#2
Posted 17 July 2009 - 02:59 PM
I'm not really sure what you are trying to do. Crawl your own website?
#3
Posted 17 July 2009 - 03:10 PM
Yeah just the information on my own web pages.
So I think I want to try and get stuff on my web pages put on mySQL database and then the spider searches that ???
So I think I want to try and get stuff on my web pages put on mySQL database and then the spider searches that ???
#4
Posted 17 July 2009 - 06:04 PM
Making a spider for that would be a waste of time. You can use a MySQL query to achive the same results with much less effort. Spidrs are better for sites that you do not have access to there database.
#5
Guest_Jordan_*
Posted 17 July 2009 - 06:48 PM
Guest_Jordan_*
How is your site built, statically (every page hand written) or dynamically such as this forum (each page driven by database results)? Depending on such, Zero's answer could be correct (#2, database driven).
Aside from that, building a spider isn't hard. You can use CURL to crawl pages and extract HTML data.
Aside from that, building a spider isn't hard. You can use CURL to crawl pages and extract HTML data.
#6
Posted 17 July 2009 - 06:52 PM
@ Jordon - he wants his site dynmic with MySQL database
OldMac said:
So I think I want to try and get stuff on my web pages put on mySQL database and then the spider searches that ???
#7
Guest_Jordan_*
Posted 17 July 2009 - 06:55 PM
Guest_Jordan_*
Oh, I suppose I should of read his second response/reply. /* dumbass */
#8
Posted 18 July 2009 - 01:01 AM
Learning MySQL isn't that difficult.
Although complex queries are a little bit on the hard side to think of in beginning stages.
You will pickup in no time.
Although complex queries are a little bit on the hard side to think of in beginning stages.
You will pickup in no time.
#9
Posted 18 July 2009 - 01:39 AM
I'm willing to go with whatever is the easiest option is, ideally though I would like to make the spider with PHP if that is at all possible.
I'm really at a last resort now so any help will be much appreciated.
P.S the webpages are going to be very basic with just a few pieces of text, headings and a few tables nothing fancy at all for the moment until I get the spider done.
I'm really at a last resort now so any help will be much appreciated.
P.S the webpages are going to be very basic with just a few pieces of text, headings and a few tables nothing fancy at all for the moment until I get the spider done.
#10
Posted 18 July 2009 - 02:54 AM
I've found this coding....
phpcodesnippets.com
But I want it to track inputted data opposed to URL what modification could I make from this coding or would I have to revamp all of it ?
phpcodesnippets.com
But I want it to track inputted data opposed to URL what modification could I make from this coding or would I have to revamp all of it ?
#11
Guest_Jaan_*
Posted 18 July 2009 - 05:47 AM
Guest_Jaan_*
Btw guys you gave me a good idea for a tutorial :D
#12
Posted 18 July 2009 - 06:46 AM
Any ideas guys ??
I've been searching the web and have seen maybe the term crawler would be more specific to what I need ?
I've been searching the web and have seen maybe the term crawler would be more specific to what I need ?


Sign In
Create Account

Back to top









