Sample Image OCR and Index Information Files for providing to Scanning Outsourcers - Support

Sample Image OCR and Index Information Files for providing to Scanning Outsourcers

From Support

Jump to: navigation, search

The Image Server/400 and WebDocs Batch Import processes need 3 files for input potentially. This depends on what type of information is being scanned and batch imported into the system.

Each file prefix must match so that the batch import process knows how to keep them together. There is no rule for file naming other than the fact that the .TIF, .OCR and .RDX must all start with the same prefix for each document.

When using imaging outsource providers, the following files may be provided as results from the outsource provider.

1.) Image File - Required: A TIF or PDF based scanned image document. File Name Ex: (IMAGE1.TIF) See attached sample TIF file.

2.) Basic Index Property File - Required: A Windows INI formatted file containing the basic index properties. File Name Ex: (IMAGE1.RDX) See attached sample RDX file. (Can be opened with Windows Notepad or other ASCII editor.)

The index properties are as follows:

[DocSettings]
DocFile=DOC-2003120909340707.TIF
DocFolder=TEST / /
DocType=AR
DocTitle=Invoice
DocIndex01=123456
DocIndex02=Joseph Gamba
DocIndex03=BL 03012
DocIndex04=1001
DocIndex05= 
DocIndex06= 
DocIndex07= 
DocIndex08= 
DocIndex09= 
DocIndex10= 

3.) Full text OCR File - Optional: A text, RTF or MS Word formatted document containing full page OCR information. This file is only required when doing full text indexing of documents. File Name Ex: (IMAGE1.OCR) See attached sample OCR file. (Can be opened with Windows Notepad or other ASCII editor.)

The OCR text can be unformatted or formatted as listed below:

Unformatted OCR Text Sample:

Invoice
Invoice Information

Software
Systems, Inc.
Invoice No:	BL 03012
	Invoice Date:	01/28/97
Order No:	1001

Ship To

Gamba,Joseph R.
1423 Willowbrook Lane
Rose Isle, FL
32789-0000

(Ship Date
Terms
Ship Via
Sales Person
Due Date
	01/28/97	2%lONet3O	UPS Ground	3	02/27/97
	3.00	3.00 MX100	349.00	Y	1047.00
U0	R	EasY~airI S	T F	D
	All Currency is in U.S. Dollars	 Subtotal	1047.00
		Sales Tax	94.23
	Notice On all Past Due Accounts 1.5%	Freight	0.00
Interest Charge Added Per Month
	Minimum Charge $10.00	Invoice Total	1141.23
K	Bill To
Gamba,Joseph R.
1423 Willowbrook Lane
Rose Isle, FL
32789-0000

Formatted OCR Text Sample:

$10.00
(ship
0.00
01/28/97
02/27/97
03012
1.5%
1001
1047.00
1141.23
1423
2%lonet3o
3.00
32789-0000
349.00
94.23
accounts
added
all
bill
bl
charge
currency
date
date:
dollars
due
easy~airi
fl
freight
gamba
ground
in
inc.
information
interest
invoice
is
isle
joseph
lane
minimum
month
mx100
no:
notice
on
order
past
per
person
r.
rose
sales
ship
software
subtotal
systems
tax
terms
to
total
u.s.
u0
ups
via
willowbrook


If you have questions on the information enclosed in this document, please contact RJS Software Systems Inc. (888) RJS-SOFT -or- sales@rjssoftware.com

Personal tools