User Tools

Site Tools


software:pdfprocessing

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
software:pdfprocessing [2011/08/07 17:22] – [Workflow] adminsoftware:pdfprocessing [2011/08/22 22:28] (current) admin
Line 1: Line 1:
 ====== Processing sheetmusic ====== ====== Processing sheetmusic ======
 During a period of one year I have experimented a lot with scanning sheetmusic and converting that fast into high quality pdf files. The experiments resulted in following useful software: During a period of one year I have experimented a lot with scanning sheetmusic and converting that fast into high quality pdf files. The experiments resulted in following useful software:
-  * [[software:pdfprocessing:FlattenANDCompactPDF_A4Size|FlattenANDCompactPDF_A4Size]] This program is useful for autonomously and accurately converting any pdf or tif file to a compact pdf file containing only black and white bitmaps and fitting exactly to a4. It extensively analyzes all pdf pages within the source file before any processing is done to compensate for possible rasterization rounding errors. This will guarantee that an a4 sized page in a pdf file with a bitmap raster of 2480x3508 pixels will remain the same size. If one or more page(s) are larger, it will be reduced to a4, in such a manner that the largest page determines the scale factor for all pages. Smaller bitmaps are centered on a page, with whitespace around it. So called portrait and landscape sizes are also automatically recognized. The output resolution is per default 300dpi but can be set to any other value. See the code for instructions.\\ Another useful purpose of this software is to combine layers within a pdf to a single layer, which happen when annotating has been done into the pdf. An extra layer makes the file larger in size, which can be merged into one layer with this software.+  * [[software:pdfprocessing:MakePDF|MakePDF]] This program is useful for autonomously and accurately converting any pdf or tiff file to a compact pdf file containing only black and white bitmaps and fitting exactly to a4. It extensively analyzes all pdf pages within the source file before any processing is done to compensate for possible rasterization rounding errors. This will guarantee that an a4 sized page in a pdf file with a bitmap raster of 2480x3508 pixels will remain the same size. If one or more page(s) are larger, it will be reduced to a4, in such a manner that the largest page determines the scale factor for all pages. Smaller bitmaps are centered on a page, with whitespace around it. So called portrait and landscape sizes are also automatically recognized. The output resolution is per default 300dpi but can be set to any other value. See the code for instructions.\\ Another useful purpose of this software is to combine layers within a pdf to a single layer, which happen when annotating has been done into the pdf. An extra layer makes the file larger in size, which can be merged into one layer with this software.
   *  [[software:pdfprocessing:pdf2tif_300dpi|pdf2tif_300dpi]] This program is useful for accurately converting a pdf file to a series of tiff files (one file for each page), at a resolution which is specified in the filename and compensate for possible rasterization rounding errors. The format of the generated tiff files are specially encoded to be used with the program [[http://scantailor.sourceforge.net/|scantailor]]. (So a fix dpi can be omitted).   *  [[software:pdfprocessing:pdf2tif_300dpi|pdf2tif_300dpi]] This program is useful for accurately converting a pdf file to a series of tiff files (one file for each page), at a resolution which is specified in the filename and compensate for possible rasterization rounding errors. The format of the generated tiff files are specially encoded to be used with the program [[http://scantailor.sourceforge.net/|scantailor]]. (So a fix dpi can be omitted).
  
Line 9: Line 9:
   - Acquire a fast and good scanner (I had an Epson GT-10000, later on succeeded with an Epson GT-30000 which was even much faster. The GT-30000 is faster than I can turn pages, so I don't have to wait for that device.)   - Acquire a fast and good scanner (I had an Epson GT-10000, later on succeeded with an Epson GT-30000 which was even much faster. The GT-30000 is faster than I can turn pages, so I don't have to wait for that device.)
   - Use [[http://www.hamrick.com/|Vuescan]] to scan automatically each 6 seconds a single a3 tiff page at 300 dpi 8-bit grayscale. Alternatively, if I receive from someone an unprocessed pdf file, I use [[software:pdfprocessing:pdf2tif_300dpi|pdf2tif_300dpi]] to convert it to single tiff pages with proper resolution and grayscaled.   - Use [[http://www.hamrick.com/|Vuescan]] to scan automatically each 6 seconds a single a3 tiff page at 300 dpi 8-bit grayscale. Alternatively, if I receive from someone an unprocessed pdf file, I use [[software:pdfprocessing:pdf2tif_300dpi|pdf2tif_300dpi]] to convert it to single tiff pages with proper resolution and grayscaled.
-  - Use [[http://scantailor.sourceforge.net/|scantailor]] to rotate, straighten, remove borders, filter and convert to black and white bitmaps. I tested extensively other software like [[software:unpaper_test|unpaper]] but I found that scantailor performs way faster and much more accurate then unpaper. Alternatively I have also experimented a lot with using the image processing features of [[http://www.neuratron.com/photoscore.htm|Neuratron Photoscore]] but found that scantailor is still much faster and gives better handling to fine-tune the output. +  - Use [[http://scantailor.sourceforge.net/|scantailor]] to rotate, straighten, remove borders, filter and convert to black and white bitmaps. I tested extensively other software like [[software:unpaper_test|unpaper]] but I found that scantailor performs way faster and much more accurate then unpaper. Alternatively I have also experimented a lot with using the image processing features of [[http://www.neuratron.com/photoscore.htm|Neuratron Photoscore]] controlled externally by [[software:musicpdf|a script]], but found that scantailor is still much faster and gives better handling to fine-tune the output. 
-  - Use [[software:pdfprocessing:FlattenANDCompactPDF_A4Size|FlattenANDCompactPDF_A4Size]] to autonomously resize and/or convert the previously generated tiff files to an a4 sized pdf file with exactly 2480x3508 (or 3508x2480) pixels with g4 encoding((I experienced that other, more efficient types of lossless coding may be better, but will put a much heavier load on the processor while viewing pdf files. This adversely influenced a smooth user experience.)). Select the tiff files, drag and drop them onto the exe file. The encoding results in pdf pages with an average of 60 kB per page.+  - Use [[software:pdfprocessing:MakePDF|MakePDF]] to autonomously resize and/or convert the previously generated tiff files to an a4 sized pdf file with exactly 2480x3508 (or 3508x2480) pixels with g4 encoding((I experienced that other, more efficient types of lossless coding may result in smaller files, but will put a much heavier load on the processor while viewing pdf files. This adversely influenced a smooth user experience.)). Select the tiff files, drag and drop them onto the exe file. The encoding results in pdf pages with an average of 60 kB per page
 + 
 +  * The MakePDF program accepts apart from tiff files also pdf files. This can be very useful if you have to convert one or more pdf file(s) with pages which are too small or too large for a pdf file with a4 sized pages. 
 +  * To correct wrong placing of odd and even pages, you can enable the -q option (by renaming the MakePDF.exe to MakePDF -q.exe). What it will do is add at the end of the document as many pages as necessary to create a document with a multiple of 4 pages. This can specifically be practical if your target format is a booklet. As a last step, empty pages need to be moved manually to the right location. 
 +  * Annotations: Use [[http://www.ograhl.com/en/pdfannotator/|PDF Annotator]] with a pen tablet((I used a Wacom Intuous3 a5 tablet)) to write instructions (like bowings) directly into the pdf file. Afterwards use [[software:pdfprocessing:MakePDF|MakePDF]] a second time to merge these annotations with the music into one bitmap and produce again a compact pdf file. 
 +===== Webcast ===== 
 +Please see [[http://www.auditeon.com/xyz/webcast/ScanSheetmusicDemo.htm|here]] for a webcast, using the workflow from above.
  
-  * The FlattenANDCompactPDF_A4Size program accepts apart from tiff files also pdf files. This can be very useful if you have to convert one or more pdf file(s) with pages which are too small or too large for a pdf file with a4 sized pages. 
-  * To correct wrong placing of odd and even pages, you can enable the -q option (by renaming the FlattenANDCompactPDF_A4Size.exe to FlattenANDCompactPDF_A4Size -q.exe). What it will do is add at the end of the document as many pages as necessary to create a document with a multiple of 4 pages. This can specifically be practical if your target format is a booklet. As a last step, empty pages need to be moved manually to the right location. 
-  * Annotations: Use [[http://www.ograhl.com/en/pdfannotator/|PDF Annotator]] with a pen tablet((I used a Wacom Intuous3 a5 tablet)) to write instructions (like bowings) directly into the pdf file. Afterwards use [[software:pdfprocessing:FlattenANDCompactPDF_A4Size|FlattenANDCompactPDF_A4Size]] a second time to merge these annotations with the music into one bitmap and produce again a compact pdf file. 
software/pdfprocessing.1312730571.txt.gz · Last modified: 2011/08/07 17:22 by admin