Return to Product Page
Understanding Easy Mark Index
Creating output is a two step process, step one is preparing the images, OCR text files and the creation of a table with information relating to the images. With step two being moving the images into an output folder, stamping the images with a document number (if desired), creating a multi-page text file (if desired) and the creation of an output file which will be used to import the images.
Easy Index handles three types of job functions (Use Cover Pages, Import Single Page Tiffs and Import Multi- Pages Tiffs). Each of these three functions output the table "filelist.csv" and place it in the folder where the processed images reside along with single page OCR text files. The desired output of page numbering, format of text output, image stamping etc. is all based on the content of this table and is preformed in step 2.
Because of this once if a user understands the layout of the table they can modify it manually if necessary to change results. For instance an entire set of files can easily have an index value assigned to them by inserting a column in the table. A page can easily be moved or inserted by modifying the table.
The table format looks like this:
In Column A is the name of an image file. If Column B contains "STARTDOC" the file in Column A does not need to exist to create output. It means the file in Column 1 was a cover page that was read which contained indexing information which is now in Columns 4-9. Because of this, if tiff images were obtained with the import function there will never be values in the index columns unless manually entered.
Column 3 contains the document number if column 2 is "STARTDOC" or a page number if column 2 contains "PAGE". Columns 4 - 9 contain index information about the document. If a page needs to be inserted the user can add a line in the correct document, and rename the pages. When the output is created the OCR text file will not exist, so the program will create one automatically.
In the above table line 7 would be the first page of document number 2 which has indexing information containing Marketing, Manufacturing, Brochure, and Policy.