About 1,030,000 results
Open links in new tab
  1. Structure of a PDF file? - Stack Overflow

    Sep 18, 2008 · Here's the raw reference of PDF 1.7, and here's an article describing the structure of a PDF file. If you use Vim, the pdftk plugin is a good way to explore the document in an ever-so-slightly …

  2. How can I visually inspect the structure of a PDF to reverse engineer ...

    How can I inspect the structure of PDF files? Use case: I'm trying to programmatically generate PDF files (using iText). I'm having trouble achieving certain layouts, but I have PDF files with tex...

  3. How to extract data from a PDF file while keeping track of its structure?

    Jun 2, 2009 · My objective is to extract the text and images from a PDF file while parsing its structure. The scope for parsing the structure is not exhaustive; I only need to be able to identify headings and

  4. How can I extract tables as structured data from PDF documents?

    Interesting thing I came across: I parse a PDF that clearly looks like it's been generated from html/word document to pdf. When I export it from Acrobat Pro to Word document the table formatting is 100% …

  5. Invalid pdf structure issue in React project.(I am using pdfjs)

    Sep 23, 2023 · 0 I am using pdfjs to read pdf files in react app. I tested several pdf files but the browser still shows "Invalid pdf structure" message. This is the code which I tried.

  6. adobe - How to open PDF raw? - Stack Overflow

    Jul 3, 2011 · The commandline to achieve this is: qpdf --qdf original.pdf unpacked.pdf Another useful and free tool (GPL licensed, but Linux-only AFAIK) to look into PDFs is of course PDFEdit. This one …

  7. The internal structure of PDF file - Stack Overflow

    Feb 25, 2016 · I am studying the structure of a PDF file. Please do so using the PDF specification ISO 32000-1. How can I determine the info like paper size, Each page has its own size, it is defined in the …

  8. PDF file structure: what's the HTML code for? - Stack Overflow

    Jul 1, 2020 · According to the PDF standard (ISO 32000-2, similarly also already in ISO 32000-1): The PDF file begins with the 5 characters “%PDF–” (ISO 32000-2, section 7.5.2 "File header") Acrobat …

  9. How to extract text and text coordinates from a PDF file?

    I want to extract all the text boxes and text box coordinates from a PDF file with PDFMiner. Many other Stack Overflow posts address how to extract all text in an ordered fashion, but how can I do ...

  10. Issue with including a JPEG-data-structure into to PDF-code as an ...

    Dec 27, 2024 · Since this JPEG file is shown as expected (e.g, by means of IrfanView) , I think that the the byte-sequence between position 1317 and 7355 in the target-PDF should represent at least a …