Lost Password?


Go Back   CodeCall Programming Forum > Software Development > General Programming

General Programming Non language specific, Assembly, Linux/Unix, Mac and anything not covered in other topics. Talk about Programming Theory here.

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 02-15-2008, 10:23 PM
Amber8's Avatar   
Amber8 Amber8 is offline
Newbie
 
Join Date: Feb 2008
Posts: 5
Rep Power: 0
Amber8 is on a distinguished road
Default Automating Text Recognition

Hi everybody, I need to convert a bunch of pdf's to text searchable. The acrobat OCR function cant do it because the resolution is lower than the minimum required (144dpi). What I started doing is saving the pdf pages as image files, increasing the resolution in an imaging package then printing them again to pdf & doing the OCR. Obviously very repetitive & boring - i can think of much better things to do on Sat night LOL.
I was thinking of writing a script for it (using python since thats the only one I've played with in the past) but I was wondering if there exists already some piece of code to do this. I imagine its a common problem since there is a fair bit on the web talking about it but havent been able to find automated code to do it. Or if anyone has any ideas if any other language might be a better match for this??
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote

Sponsored Links
  #2 (permalink)  
Old 02-17-2008, 09:09 PM
azer24 azer24 is offline
Newbie
 
Join Date: Feb 2008
Posts: 1
Rep Power: 0
azer24 is on a distinguished road
Default

Hi Amber8,

Have you tried using Advanced PDF Manager
Code wise I have seen PDF Indexer for Joomla in PHP.
Or something like PDF to Word converter may work.
Sorry I can't post links yet but hope this helps!!!
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Text file manipulation. joe1986 Python 11 01-28-2008 01:10 AM
How to save the text in a text file ??? kresh7 Visual Basic Programming 0 11-25-2007 11:30 AM
How to style fonts of a text in a simple page? c0de Tutorials 3 09-15-2007 11:08 PM
HTML Basic Formatting clookid Tutorials 14 03-06-2007 04:10 PM
Generate text with transparent background AfTriX PHP Tutorials 1 01-08-2007 03:13 AM


All times are GMT -5. The time now is 05:57 PM.

Contest Stats

WingedPanther ........ 2753.6
Xav ........ 2704
Brandon W ........ 1702.32
John ........ 1207.73
marwex89 ........ 1175.24
morefood2001 ........ 966.05
dcs ........ 655.75
Steve.L ........ 475.59
orjan ........ 418.58
Aereshaa ........ 383.54

Contest Rules

CodeCall Goal

Goal: 100,000 Posts
Complete: 98%

Ads