Jump to content

Accessing WWW in C++ help

- - - - -

This topic has been archived. This means that you cannot reply to this topic.
3 replies to this topic

#1
atruepretense

atruepretense

    Newbie

  • Members
  • Pip
  • 7 posts
Hello all,

Well to begin, I have become very, very interested in AI. So much that I actually want to begin programming some models and possibly get something to work well.

Here's my issue. I really do not have too much experience with coding in C++ to fetch information from a website. I was definitely hoping to create a bot that would pull information from the net at certain spots. I know I can make it open an internet explorer page but i'd like it to be integrated.

This bot will simply give the "AI" starting points. From there the AI will ask the user some questions. No questions will give direct info, but will only guide the "AI" in learning the correct info.

How do you think I can get started? I want to be able to have the bot pull information from the web and save it to a file where the AI can access. Please bear with me if my questions are not totally clear. At this moment, I am still brainstorming.

#2
theonejb

theonejb

    Learning Programmer

  • Members
  • PipPipPip
  • 52 posts
Try using the CURL library (libcurl). Its very easy to use and will probably be the best starting point if you just want to do simple Web Page parsing.

#3
speculatius

speculatius

    Newbie

  • Members
  • PipPip
  • 25 posts
Hello,

I dont know about C++ stuff for information retrieval. But there is really great engine for Java called Nutch. It has everything you need - crawler, various indexers, filters, you can even run it on large clusters etc. IMHO there is no better engine for doing this kind of things.

#4
WingedPanther

WingedPanther

    A spammer's worst nightmare

  • Moderators
  • 16,831 posts
The Boost library has a lot of additional functionality for writing networked programs. This will allow you to write a C++ program that does what you want in a cross-platform way. Curl is an option if you're in a *nix environment, but your reference to IE makes me think you're on windows using Visual Studio.
Programming is a branch of mathematics.
My CodeCall Blog | My Personal Blog