Document images are documents that normally begin on paper and are then via electronics scanned. These documents have rich internal structure and might only be available in image form. Supplementally, they may have been created by a union of printing technologies (or by handwriting); and include diagrams, tables, graphics and other non-textual component. Large collections of such complex documents are commonly found in legal investigation. Many approaches come in for indexing and retrieval document images. In this paper we proposed a framework for classify document image retrieval approaches, and then we evaluated these approaches based on important measures.
Keywords
Information Retrieval, Indexing, Document Image, Machine-print, Handwriting
User
Information