Jump to content

Any Way to use MS Word's Ability to deal with PDFs? Convert PDFs?

- - - - -

  • Please log in to reply
2 replies to this topic

#1
ibad

ibad

    Learning Programmer

  • Members
  • PipPipPip
  • 39 posts
Hi,

I need to convert a pdf document to something else: either HTML or RTF.

Ofcourse you can copy text including formatting and tables from a pdf using Acrobat and paste it into Word or some other editor. Word also lets you save a docx file as a pdf. So I was thinking that maybe Word has some capability to convert pdfs to doc or something else?

If not, does Acrobat convert the text to some formatted "standard" format when we Copy?

Most of the tools I have worked with have flaws and bugs. None are able to convert a pdf to HTML or RTF accurately in 100% of cases. Some have trouble with underlines, others have trouble with bullets or line breaks in some documents (normally filings taken from the SEC's EDGAR service).

Is there any way for another application to call or use MS Word or Acrobat's ability to export to/from pdf? Are there any accurate converters that you guys know of?

Also.... is pdf even remotely human readable? I looked at some pdf files in notepad and it does not seem so. :(

Any help would be greatly appreciated!

#2
WingedPanther

WingedPanther

    A spammer's worst nightmare

  • Moderators
  • 16,831 posts
  • Location:Upstate, South Carolina
  • Programming Language:C, C++, PL/SQL, Delphi/Object Pascal, Pascal, Transact-SQL, Others
  • Learning:Java, C#, PHP, JavaScript, Lisp, Fortran, Haskell, Others
If the document isn't protected, OpenOffice has a PDF import plugin that may be useful.
Programming is a branch of mathematics.
My CodeCall Blog | My Personal Blog

#3
ibad

ibad

    Learning Programmer

  • Members
  • PipPipPip
  • 39 posts

WingedPanther said:

If the document isn't protected, OpenOffice has a PDF import plugin that may be useful.

Can the plugin be accessed from another application? What format does it import to? I need something that can be automatically accessed by code from another application.




1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users