Jump to content

WebSPHINX crawler - output alteration

- - - - -

  • Please log in to reply
No replies to this topic

#1
bemaitea

bemaitea

    Newbie

  • Members
  • Pip
  • 4 posts
Hello all!

I've managed to snag a crawler to search a list of sites for a particular set of data on each page.

The WebSPHINX crawler works beautifully and I've managed to customize it so that it searches each URL for a specific tag and extracts the information within the tags and output it to an HTML file.

I was wondering if anyone could point me to which file dictates this output format so I can perhaps change it? I've looked through the Jar file and have no idea what's what. If someone can take a peek I'd really appreciate it! :)


This is the Jar file!

Concatenation




1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users