Jump to content

need thr pros help on making a pdf convertor

- - - - -

This topic has been archived. This means that you cannot reply to this topic.
5 replies to this topic

#1
Master

Master

    Newbie

  • Members
  • PipPip
  • 13 posts
hi guys , its been a while since i started to to think about making a converter for pdf files ... and now after all , i am really serious about it ..and wana make it ar all cost ...
now i have some questions for you .:
first : how should i start my project ..i mean first of all i need to read a pdf files and get the data ... the first problem is here .. what kind of command in C should i use to get the infos ? do i just have to read char by char and then save them all those in a document ? !! and is this supposed to be right ? ( i dont think so )
i know that the pdf has a structure .. how can i find out a bit more about this so that i can retrieve the information and then do the right job ...

second : i am going to use this pdf converter to convert pdf files (which are also in farsi ) to the *.doc format ...
with the current converters , after conversion everything just gets in a reversed order .. i mean if you write "سلام " after conversion it would be sth like this "م ا ل س " so how to fix this problem too ?

(in farsi language you write from write to left)

i hope you do me a favor and just tell me what is needed and i just start the job and see what happens .

thanks alot and im waiting

#2
WingedPanther

WingedPanther

    A spammer's worst nightmare

  • Moderators
  • 16,831 posts
I would start by looking at some of the open source PDF viewers. That should help you get an idea of how they work. Second, I would look at converting to an ODF format, which supports right to left languages.
Programming is a branch of mathematics.
My CodeCall Blog | My Personal Blog

#3
Master

Master

    Newbie

  • Members
  • PipPip
  • 13 posts
tanx , ive give the first recommendation a try already but i have no idea what the ODF is /// would you mind explaining it abit more ?or just give me some principles concerning it

#4
WingedPanther

WingedPanther

    A spammer's worst nightmare

  • Moderators
  • 16,831 posts
ODF is the OpenDocument Format. It's based on XML, and is used by programs such as OpenOffice. You should be able to get basic specs on it fairly easily with a Google search.
Programming is a branch of mathematics.
My CodeCall Blog | My Personal Blog

#5
Master

Master

    Newbie

  • Members
  • PipPip
  • 13 posts
tanx pal . im on it

#6
dargueta

dargueta

    Writes binary right handed and hex left handed

  • Moderators
  • 4,720 posts
Another good website you can look at for file specs is Wotsit.org. Here are some results for PDF file formats:

PDF File Format
More PDF File Format Specs