Closed Thread
Page 1 of 2 12 LastLast
Results 1 to 10 of 12

Thread: Indexing files on an apache server.

  1. #1
    hodge-podge is offline Learning Programmer
    Join Date
    Jun 2009
    Location
    New Hampshire.
    Posts
    47
    Rep Power
    0

    Indexing files on an apache server.

    Alright, here's my problem. The Library of Congress has an atrocious site. Navigating the ui is painful enough, but finding pictures is even harder. As of now I've resorted to going directly through the
    Apache Index pages to look at each photo. For each photo however there is a .gif copy, and a .tif copy along with the standard jpg copy. For some reason apache doesn't have the option to arrange the files by file type, and sorting through these pictures is almost as annoying using the main site. So what Im hoping to do is write an program to scrape the html of the the page and arrange the links by file type.

    And this is what I'm talking about when I say index page...

    http://memory.loc.gov/service/pnp/cp...41000/3b41500/

    So here's my question:

    What language would be best to accomplish this?
    And, is this not merely a simple task, and I'd be better off dealing with it.
    If that is the case, does anyone know of software that can do this?

  2. CODECALL Circuit advertisement
    Join Date
    Always
    Location
    Advertising world
    Posts
    Many

     
  3. #2
    Join Date
    Oct 2007
    Location
    /dev/null
    Posts
    4,513
    Blog Entries
    8
    Rep Power
    59

    Re: Indexing files on an apache server.

    Funny you should ask - Last year I actually did something incredibly similar parsing the HTML of the LOC site for ISBN information. I can give you the code and you can modify it, if you like. What operating system do you want this for?
    sudo rm -rf /

  4. #3
    hodge-podge is offline Learning Programmer
    Join Date
    Jun 2009
    Location
    New Hampshire.
    Posts
    47
    Rep Power
    0

    Re: Indexing files on an apache server.

    That'd be awesome. When you say operating system, I'm assuming you are asking what os I'm using. I am using windows....

  5. #4
    Join Date
    Oct 2007
    Location
    /dev/null
    Posts
    4,513
    Blog Entries
    8
    Rep Power
    59

    Re: Indexing files on an apache server.

    Perfect. Here you go. Mind you, it's multithreaded, so be careful how many threads you use. I used 64 and crashed the LOC server. Once I stopped my program the site was back up.

    I am not responsible if things go wrong.
    sudo rm -rf /

  6. #5
    hodge-podge is offline Learning Programmer
    Join Date
    Jun 2009
    Location
    New Hampshire.
    Posts
    47
    Rep Power
    0

    Re: Indexing files on an apache server.

    Haha, really you crashed it? How would it do that? Simply to many requests? And thanks for this, I really appreciate it.

  7. #6
    Join Date
    Oct 2007
    Location
    /dev/null
    Posts
    4,513
    Blog Entries
    8
    Rep Power
    59

    Re: Indexing files on an apache server.

    Apparently it can't handle 64 requests at the same time. Basically an unintentional DoS attack. I scaled it back to 32 and I think that worked.
    sudo rm -rf /

  8. #7
    Join Date
    Jul 2006
    Posts
    16,491
    Blog Entries
    75
    Rep Power
    143

    Re: Indexing files on an apache server.

    I remember the first time I heard about that. Somehow, I can actually believe that. A ddos from a single computer in a campus dorm
    Programming is a branch of mathematics.
    My CodeCall Blog | My Personal Blog

  9. #8
    Join Date
    Oct 2007
    Location
    /dev/null
    Posts
    4,513
    Blog Entries
    8
    Rep Power
    59

    Re: Indexing files on an apache server.

    Amazing what people can do with their free time. Just to be an idiot I sent two friends an email from another friend (we're all close, so it's okay). It had...some rather...um... I'll just leave it at "there were goats involved." Anyway, I spoofed the headers to make it seem like it came from my victim friend. The others eventually figured it out and came to my room at three in the morning. When I answered the door, they attacked me with a large vibrating dildo. Apparently they found it at a frat house.
    sudo rm -rf /

  10. #9
    chirag.jain18 is offline Newbie
    Join Date
    Oct 2009
    Posts
    10
    Rep Power
    0

    Re: Indexing files on an apache server.

    Quote Originally Posted by dargueta View Post
    I spoofed the headers to make it seem like it came from my victim friend.
    Hey,what email server you were using? How did you chang the headers? I am interested to know how it is done.Please guide me.

  11. #10
    Join Date
    Oct 2007
    Location
    /dev/null
    Posts
    4,513
    Blog Entries
    8
    Rep Power
    59

    Re: Indexing files on an apache server.

    Pretty much any computer connected to a network with a program like sendmail installed can pull it off.
    Last edited by dargueta; 03-10-2010 at 12:30 AM. Reason: Added link
    sudo rm -rf /

Closed Thread
Page 1 of 2 12 LastLast

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Apache Server Ubuntu
    By VakhoQ in forum PHP Development
    Replies: 3
    Last Post: 12-11-2010, 04:41 AM
  2. .htaccess sometimes disappearing at apache server?
    By onething in forum Linux Networking
    Replies: 2
    Last Post: 12-08-2010, 04:33 AM
  3. Replies: 5
    Last Post: 12-06-2010, 07:22 PM
  4. Free service to block online visitors by country in Apache Web server
    By mialye in forum Services for Buy/Sell/Trade
    Replies: 0
    Last Post: 09-30-2009, 10:29 PM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts