Lost Password?


Go Back   CodeCall Programming Forum > Web Development Forum > PHP Forum

PHP Forum Use this forum to discuss all aspects of PHP Development. PHP is a server-side, cross-platform, HTML embedded scripting language that lets you create dynamic web pages.

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 10-11-2007, 01:53 PM
burningdowntrees burningdowntrees is offline
Newbie
 
Join Date: Oct 2007
Posts: 3
Rep Power: 0
burningdowntrees is on a distinguished road
Default Looking for guidance on text-parsing project

Hi,

I'm interested in creating a program that essentially text parses format-consistent forms. I want the program to be able to pull data from these forms and database it.

I plan on hiring a programmer but because of my limited knowledge on this topic I dont really know where to look.

So my questions are:

1. How feasible is such a program?

2. Whats the best language to use for this particular project?

I created a posting in the classifieds section regarding this job, so if anyone is interested please reference that posting.

Thanks in advance!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote

Sponsored Links
  #2 (permalink)  
Old 10-11-2007, 03:36 PM
John's Avatar   
John John is offline
Co-Administrator
 
Join Date: Jul 2006
Age: 20
Posts: 3,433
Last Blog:
Google Web Toolkit
Rep Power: 20
John has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond reputeJohn has a reputation beyond repute
Send a message via AIM to John Send a message via MSN to John
Default

Actually, for the past several months I've been working on a program for a game I play. Essentially all statistics for a player are stored on a remote site in which I don't have access to their database directly. So I had to create, what I like to call an "engine," to connect to the site, parse the statistics, and store those stats in a local database. That way I can perform manipulations on them as I wished.

I've searched and looked into other languages, but PHP works just as well as any other language.

With that said, a few notes I've found that may help you are:

1) Parsing a remote site depends on the sites content. If the remote site updates their content, changes some text, rearranges their site, you have to make changes to your parsing engine.

2) Since you are connecting to a remote site, potentially many times in a short time period, you want to make sure your server doesn't get banned from their server. As they may mistake several connections for a ddos attack. Therefor, it might be a good idea to use multiple proxy's that can use to connect to the site.

3) Depending on how much content you need to parse, the load itself can use many of your server's resources. My script originally ran every twenty-four hours, and parsed up to one hundred pages. This caused a significant load on the server, so with some minor tweaks, making the script run every hour and parse four pages improved the engine's performance.
__________________
CodeCall Blog | CodeCall Wiki | Shareware | Linux Forum | My Blog
Chat with other CodeCall members on IRC; connect to irc.codecall.net and join #codecall
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
A cool project idea. AlexanderRybak Community Projects 3 02-11-2008 03:37 AM
How to style fonts of a text in a simple page? c0de Tutorials 3 09-15-2007 11:08 PM
Usage of array structures to increment letter instances of text Yuriy M C and C++ 2 09-13-2007 11:49 AM
HTML Basic Formatting clookid Tutorials 14 03-06-2007 04:10 PM
Generate text with transparent background AfTriX PHP Tutorials 1 01-08-2007 03:13 AM


All times are GMT -5. The time now is 06:51 AM.

Contest Stats

WingedPanther ........ 2753.6
Xav ........ 2704
Brandon W ........ 1702.32
John ........ 1207.73
marwex89 ........ 1175.24
morefood2001 ........ 966.05
dcs ........ 655.75
Steve.L ........ 475.59
orjan ........ 418.58
Aereshaa ........ 383.54

Contest Rules

CodeCall Goal

Goal: 100,000 Posts
Complete: 97%

Ads