The Resource Guide to OCR for Indic Scripts : Document Recognition and Retrieval, edited by Venu Govindaraju, Srirangaraj (Ranga) Setlur, (electronic resource)

Guide to OCR for Indic Scripts : Document Recognition and Retrieval, edited by Venu Govindaraju, Srirangaraj (Ranga) Setlur, (electronic resource)

Label
Guide to OCR for Indic Scripts : Document Recognition and Retrieval
Title
Guide to OCR for Indic Scripts
Title remainder
Document Recognition and Retrieval
Statement of responsibility
edited by Venu Govindaraju, Srirangaraj (Ranga) Setlur
Contributor
Editor
Editor
Subject
Language
  • eng
  • eng
Summary
Optical Character Recognition (OCR) is a key enabling technology critical to creating indexed, digital library content, and it is especially valuable for Indic scripts, for which there has been very little digital access. Indic scripts, the ancient Brahmi scripts prevalent in the Indian subcontinent, present some challenges for OCR that are different from those faced with Latin and Oriental scripts. But properly utilized, OCR will help to make Indic digital archives practically accessible to researchers and lay users alike by creating searchable indexes and machine-readable text repositories. This unique guide/reference is the very first comprehensive book on the subject of OCR for Indic scripts, providing an overview of the state-of-the-art research in this field as well as other issues related to facilitating query and retrieval of Indic documents from digital libraries. All major research groups working in this area are represented in this book, which is divided into sections on recognition of Indic scripts and retrieval of Indic documents. Topics and features: Contains contributions from the leading researchers in the field Discusses data set creation for OCR development Describes OCR systems that cover eight different scripts: Bangla, Devanagari, Gurmukhi, Gujarati, Kannada, Malayalam, Tamil, and Urdu (Perso-Arabic) Explores the challenges of Indic script handwriting recognition in the online domain Examines the development of handwriting-based text input systems Describes ongoing work to increase access to Indian cultural heritage materials Provides a section on the enhancement of text and images obtained from historical Indic palm leaf manuscripts Investigates different techniques for word spotting in Indic scripts Reviews mono-lingual and cross-lingual information retrieval in Indic languages This is an excellent reference for researchers and graduate students studying OCR technology and methodologies. This volume will contribute to opening up the rich Indian cultural heritage embodied in millions of ancient and contemporary documents spanning topics such as science, literature, medicine, astronomy, mathematics and philosophy. Venu Govindaraju FIEEE FIAPR, is a Distinguished Professor of Computer Science and Engineering at the University at Buffalo. He has over 20 years of research experience in pattern recognition, information retrieval and biometrics. His seminal work on handwriting recognition was at the core of the first handwritten address interpretation system used by the U.S. Postal Service. Srirangaraj Setlur SMIEEE, is a Principal Research Scientist at the University at Buffalo. He has over 15 years of research experience in pattern recognition that includes NSF sponsored work on multilingual OCR technologies for digital libraries and other applications. His work on postal automation has led to technology adopted by the U.S. Postal Service, and Royal Mail in the U.K
Member of
Is Subseries of
Dewey number
  • 006.4/24
  • 006.424
http://bibfra.me/vocab/relation/httpidlocgovvocabularyrelatorsedt
  • hBWtAzaV0pI
  • BPijhtcF6-8
Language note
English
LC call number
QA76.9.N38
Literary form
non fiction
Nature of contents
dictionaries
http://library.link/vocab/relatedWorkOrContributorName
  • Govindaraju, Venu.
  • Setlur, Srirangaraj (Ranga).
Series statement
Advances in Computer Vision and Pattern Recognition,
http://library.link/vocab/subjectName
  • Natural language processing (Computer science)
  • Natural Language Processing (NLP)
Label
Guide to OCR for Indic Scripts : Document Recognition and Retrieval, edited by Venu Govindaraju, Srirangaraj (Ranga) Setlur, (electronic resource)
Instantiates
Publication
Note
Description based upon print version of record
Bibliography note
Includes bibliographical references and index
Carrier category
online resource
Carrier category code
  • cr
Content category
text
Content type code
  • txt
Contents
Section: Recognition of Indic scripts -- Building Data Sets for Indian Language OCR Research -- On OCR of Major Indian Scripts: Bangla and Devanagari -- A Complete Machine-Printed Gurmukhi OCR System -- Progress in Gujarati Document Processing and Character Recognition -- Design of a Bilingual Kannada–English OCR -- Recognition of Malayalam Documents -- A Complete OCR System for Tamil Magazine Documents -- Experiments on Urdu Text Recognition -- The BBN Byblos Hindi OCR System -- Generalization of Hindi OCR Using Adaptive Segmentation and Font Files -- Online Handwriting Recognition for Indic Scripts -- Section: Retrieval of Indic documents -- Enhancing Access to Primary Cultural Heritage Materials of India -- Digital Image Enhancement of Indic Historical Manuscripts -- GFG-Based Compression and Retrieval of Document Images in Indian Scripts -- Word Spotting for Indic Documents to Facilitate Retrieval -- Indian Language Information Retrieval
Dimensions
unknown
Edition
1st ed. 2010.
Extent
1 online resource (333 p.)
Form of item
online
Isbn
9786612361913
Media category
computer
Media type code
  • c
Other control number
10.1007/978-1-84800-330-9
Specific material designation
remote
System control number
  • (CKB)1000000000798360
  • (EBL)993166
  • (OCoLC)809769577
  • (SSID)ssj0000298031
  • (PQKBManifestationID)11238787
  • (PQKBTitleCode)TC0000298031
  • (PQKBWorkID)10344051
  • (PQKB)10902099
  • (SSID)ssj0000770495
  • (PQKBManifestationID)12386879
  • (PQKBTitleCode)TC0000770495
  • (PQKBWorkID)10786089
  • (PQKB)11666867
  • (DE-He213)978-1-84800-330-9
  • (MiAaPQ)EBC993166
  • (EXLCZ)991000000000798360
Label
Guide to OCR for Indic Scripts : Document Recognition and Retrieval, edited by Venu Govindaraju, Srirangaraj (Ranga) Setlur, (electronic resource)
Publication
Note
Description based upon print version of record
Bibliography note
Includes bibliographical references and index
Carrier category
online resource
Carrier category code
  • cr
Content category
text
Content type code
  • txt
Contents
Section: Recognition of Indic scripts -- Building Data Sets for Indian Language OCR Research -- On OCR of Major Indian Scripts: Bangla and Devanagari -- A Complete Machine-Printed Gurmukhi OCR System -- Progress in Gujarati Document Processing and Character Recognition -- Design of a Bilingual Kannada–English OCR -- Recognition of Malayalam Documents -- A Complete OCR System for Tamil Magazine Documents -- Experiments on Urdu Text Recognition -- The BBN Byblos Hindi OCR System -- Generalization of Hindi OCR Using Adaptive Segmentation and Font Files -- Online Handwriting Recognition for Indic Scripts -- Section: Retrieval of Indic documents -- Enhancing Access to Primary Cultural Heritage Materials of India -- Digital Image Enhancement of Indic Historical Manuscripts -- GFG-Based Compression and Retrieval of Document Images in Indian Scripts -- Word Spotting for Indic Documents to Facilitate Retrieval -- Indian Language Information Retrieval
Dimensions
unknown
Edition
1st ed. 2010.
Extent
1 online resource (333 p.)
Form of item
online
Isbn
9786612361913
Media category
computer
Media type code
  • c
Other control number
10.1007/978-1-84800-330-9
Specific material designation
remote
System control number
  • (CKB)1000000000798360
  • (EBL)993166
  • (OCoLC)809769577
  • (SSID)ssj0000298031
  • (PQKBManifestationID)11238787
  • (PQKBTitleCode)TC0000298031
  • (PQKBWorkID)10344051
  • (PQKB)10902099
  • (SSID)ssj0000770495
  • (PQKBManifestationID)12386879
  • (PQKBTitleCode)TC0000770495
  • (PQKBWorkID)10786089
  • (PQKB)11666867
  • (DE-He213)978-1-84800-330-9
  • (MiAaPQ)EBC993166
  • (EXLCZ)991000000000798360

Library Locations

  • Architecture LibraryBorrow it
    Gould Hall 830 Van Vleet Oval Rm. 105, Norman, OK, 73019, US
    35.205706 -97.445050
  • Bizzell Memorial LibraryBorrow it
    401 W. Brooks St., Norman, OK, 73019, US
    35.207487 -97.447906
  • Boorstin CollectionBorrow it
    401 W. Brooks St., Norman, OK, 73019, US
    35.207487 -97.447906
  • Chinese Literature Translation ArchiveBorrow it
    401 W. Brooks St., RM 414, Norman, OK, 73019, US
    35.207487 -97.447906
  • Engineering LibraryBorrow it
    Felgar Hall 865 Asp Avenue, Rm. 222, Norman, OK, 73019, US
    35.205706 -97.445050
  • Fine Arts LibraryBorrow it
    Catlett Music Center 500 West Boyd Street, Rm. 20, Norman, OK, 73019, US
    35.210371 -97.448244
  • Harry W. Bass Business History CollectionBorrow it
    401 W. Brooks St., Rm. 521NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • History of Science CollectionsBorrow it
    401 W. Brooks St., Rm. 521NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • John and Mary Nichols Rare Books and Special CollectionsBorrow it
    401 W. Brooks St., Rm. 509NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • Library Service CenterBorrow it
    2601 Technology Place, Norman, OK, 73019, US
    35.185561 -97.398361
  • Price College Digital LibraryBorrow it
    Adams Hall 102 307 West Brooks St., Norman, OK, 73019, US
    35.210371 -97.448244
  • Western History CollectionsBorrow it
    Monnet Hall 630 Parrington Oval, Rm. 300, Norman, OK, 73019, US
    35.209584 -97.445414
Processing Feedback ...