Intelligent Document Capture with Ephesoft
By Pat Myers, Ike Kavas, Clifford Laurin and Michael Müller
()
About this ebook
Pat Myers
Pat Myers is the executive vice president and a cofounder of Zia Consulting, a content-centric solutions firm. Zia is a platinum Ephesoft and Alfresco partner that provides solutions from paper to mobile. Zia was the Ephesoft Partner of the Year in 2012, 2013, and 2014. Pat has over 14 years of Enterprise Content Management (ECM) experience and 18 years of experience in professional services and application development. Pat and Ike Kavas developed the first official Ephesoft training, and Pat is the coauthor of Intelligent Document Capture with Ephesoft, First Edition.
Related to Intelligent Document Capture with Ephesoft
Related ebooks
Intelligent Document Capture with Ephesoft - Second Edition Rating: 0 out of 5 stars0 ratingsHow to successfully implement an ERP Rating: 0 out of 5 stars0 ratingsMicrosoft Certified Database Administrator A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsData Marts A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsMaking Technology Investments Profitable: ROI Road Map from Business Case to Value Realization Rating: 3 out of 5 stars3/5Insurance Policy Management A Complete Guide - 2019 Edition Rating: 0 out of 5 stars0 ratingsCost Capacity Planning A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsAPIs A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsMicrosoft Dynamics CRM Online A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsAmazon Redshift A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsIT Infrastructure Deployment A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsBusiness Relationship Management A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsSaaS CRM Standard Requirements Rating: 0 out of 5 stars0 ratingsCloud Environments A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsMicrosoft SQL Server 2008 R2 Master Data Services Rating: 0 out of 5 stars0 ratingsPodman in Action: Secure, rootless containers for Kubernetes, microservices, and more Rating: 0 out of 5 stars0 ratingsElectronic Document Management Systems A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsData Hub Architecture The Ultimate Step-By-Step Guide Rating: 0 out of 5 stars0 ratingsArtificial Intelligence in Program and Project Management Rating: 0 out of 5 stars0 ratingsAutomation A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsSingle Sign On A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsIT Infrastructure And Business Application Monitoring A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsIT Risk Management A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsRequirement Analysis A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsExcel: A Step-by-Step Guide with Practical Examples to Master Excel's Basics, Functions, Formulas, Tables, and Charts Rating: 0 out of 5 stars0 ratingsProcess Mining: Fundamentals and Applications Rating: 0 out of 5 stars0 ratingsDynamics 365 Field Service: Implementing Business Solutions for the Enterprise Rating: 0 out of 5 stars0 ratings(Excerpts From) Investigating Performance: Design and Outcomes With Xapi Rating: 0 out of 5 stars0 ratingsIBM Cognos 8 Report Studio Cookbook Rating: 0 out of 5 stars0 ratings
Computers For You
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics Rating: 4 out of 5 stars4/5The Hacker Crackdown: Law and Disorder on the Electronic Frontier Rating: 4 out of 5 stars4/5101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters Rating: 4 out of 5 stars4/5Slenderman: Online Obsession, Mental Illness, and the Violent Crime of Two Midwestern Girls Rating: 4 out of 5 stars4/5Master Builder Roblox: The Essential Guide Rating: 4 out of 5 stars4/5The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology Rating: 0 out of 5 stars0 ratingsElon Musk Rating: 4 out of 5 stars4/5CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61 Rating: 0 out of 5 stars0 ratingsSQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL Rating: 4 out of 5 stars4/5Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands Rating: 5 out of 5 stars5/5The Invisible Rainbow: A History of Electricity and Life Rating: 4 out of 5 stars4/5Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad Rating: 0 out of 5 stars0 ratingsMastering ChatGPT: 21 Prompts Templates for Effortless Writing Rating: 5 out of 5 stars5/5Learning the Chess Openings Rating: 5 out of 5 stars5/5Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are Rating: 4 out of 5 stars4/5Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition Rating: 4 out of 5 stars4/5User Friendly: How the Hidden Rules of Design Are Changing the Way We Live, Work, and Play Rating: 4 out of 5 stars4/5Storytelling with Data: Let's Practice! Rating: 4 out of 5 stars4/5CompTIA Security+ Practice Questions Rating: 2 out of 5 stars2/5Deep Search: How to Explore the Internet More Effectively Rating: 5 out of 5 stars5/5AP® Computer Science Principles Crash Course Rating: 0 out of 5 stars0 ratingsGarageBand Basics: The Complete Guide to GarageBand: Music Rating: 0 out of 5 stars0 ratingsDark Aeon: Transhumanism and the War Against Humanity Rating: 5 out of 5 stars5/5The Professional Voiceover Handbook: Voiceover training, #1 Rating: 5 out of 5 stars5/5
Reviews for Intelligent Document Capture with Ephesoft
0 ratings0 reviews
Book preview
Intelligent Document Capture with Ephesoft - Pat Myers
Table of Contents
Intelligent Document Capture with Ephesoft
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Support files, eBooks, discount offers and more
Why Subscribe?
Free Access for Packt account holders
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Errata
Piracy
Questions
1. Introduction
Overview
History of document capture
Elements of document capture projects
The destination of the documents
The location of the documents and systems
The type of documents to be processed
Inside a capture system
Import methods
Classification methods
Extraction methods
Export methods
What sets document capture apart from the other ECM tools?
Summary
2. A Quick Tour of Ephesoft
The administrative user interface
Batch Class Management
Batch Instance Management
Workflow Management
Folder Management
Reports
Operator user interface
Batch List
Batch Detail
Web Scanner
Upload Batch
File system
Summary
3. Creating a Batch Class
Copying an existing batch class
Creating new document types
System training for classification and separation
Creating fields
Basic Key/Value extraction
Regular expression listing
Export
Copy Batch XML
Summary
4. Processing a Batch
Starting a batch
Document review
Document validation
Summary
5. Core Ephesoft Features
Classification
Classification types
Search
Image
Barcodes
Automatic
Confidence
Search classification
Barcode classification
Image classification
Automatic classification
Programmatic classification
Multiple layouts for a single document type
Advanced KV extraction
Fuzzy DB
Using the Web Scanner
Uploading batches
Export
CMIS Export
Establish a content model in your CMS
Configure Ephesoft CMIS Export
Configure the CMIS Export plugin
Document type and property mapping
Global CMIS configuration
Database Export
Other export plugins
Configuration management of batch classes
Summary
6. Ephesoft Extended Features
Other classification methods
Barcode classification
Image classification
RecoStar extraction
Create a RecoStar project
Configure the RecoStar project
Configure Ephesoft to use the RecoStar project
Table extraction
Scripting
Workflow scripts
Triggering scripts when a field is edited
Triggering scripts from a function key
External application integration
Configuring external application references
External application context
Security tickets
Returning document metadata to Ephesoft
Signalling dialog close and save events
Custom workflow
Adding a plugin to Ephesoft
Customizing batch class workflows
Writing a custom plugin
Grid computing
Automatic learning
Summary
7. Tips
Troubleshooting
Logging
Monitoring batch progress
Examining the batch file
Restarting the batch
No blank forms for training
Setting up Active Directory
Directory server overview
LDAP
Ephesoft LDAP connector configuration
Ephesoft directory server service account
Ephesoft directory server groups
Ephesoft LDAP configuration files
dcma.xml
user-connectivity.xml
web.xml
Troubleshooting
Setting up e-mail processing
Summary
A. Reference
Glossary
Common regular expressions
Commonly modified Ephesoft settings
dcma-workflow.properties
dcma-batch.properties
dcma-cmis.properties
Index
Intelligent Document Capture with Ephesoft
Intelligent Document Capture with Ephesoft
Copyright © 2012 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.
First published: September 2012
Production Reference: 1060912
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham B3 2PB, UK.
ISBN 978-1-84969-372-1
www.packtpub.com
Cover Image by iStockPhoto
Credits
Authors
Pat Myers
Ike Kavas
Michael Muller
Clifford Laurin
Reviewers
Eric Harper
Megan Hoffman
Anita L. Feeley
Alicia Libucha
Acquisition Editor
Mary Jasmine Nadar
Lead Technical Editor
Mary Jasmine Nadar
Technical Editor
Jalasha D'costa
Project Coordinator
Sai Gamare
Proofreader
Maria Gould
Indexer
Hemangini Bari
Graphics
Aditi Gajjar
Production Coordinator
Shantanu Zagade
Cover Work
Shantanu Zagade
Foreword
In my recent e-book #OccupyIT: A Technology Manifesto for the Cloud, Mobile, and Social Era (http://www.aiim.org/occupyIT), I talk about the revolutionary changes that are impacting how we make enterprise technology decisions.
On the one hand, we have the business,
awed and impressed by the changes and speed of implementation in the consumer technology space (think Facebook, Google, Twitter), asking their IT departments why enterprise technology has to be so old fashioned,
why implementation needs to take so long, and why enterprise technology has to be so darn expensive.
On the other hand, we have IT
, struggling to maintain order amidst the chaos, and struggling with expectations from the business
that are escalating exponentially. IT spending by IT is flat, while IT spending by the business
is increasing significantly. Clearly the traditional world of enterprise IT is changing.
In many ways, the cloud and open source revolutions are two sides of the same coin. They stem from the desire to buy technology by the glass,
to buy technology in which the release cycles are frequent and manageable rather than long and frightening, and you can try before you buy
(and especially before you scale!).
According to a recent global CIO survey, 60 percent of organizations are ready to embrace cloud computing over the next five years as a means of growing their businesses and achieving a competitive advantage. The figure is nearly twice the number of CIOs who said they would utilize the cloud in the previous study.
The impact of the cloud and open source, though, will be massive beyond the immediate revenues that will be classified in industry studies as cloud and open source because they fundamentally change the way we look at IT services, how we pay for these services within our organization (capital spending versus operating), and how we view upgrade paths (and who is responsible for these upgrades). Organizations that do not incorporate rapid and flexible implementation and adoption models into their thinking do so at their own peril.
This frame of flexibility and rapid deployment is how we need to think about an aspect of the content management industry that has been with us for a long time; capture.
No matter how elegant the frontend, Systems of Engagement (for a white paper on this, see http://aiim.org/futurehistory) cannot operate in an environment in which the processes that support and complement these Systems of Engagement are engulfed by paper and inefficiency. The reality is that most organizations exist in a hybrid environment in which process information may come from paper documents, paper forms, web forms, faxes, telephony, e-mails, SMS, mobile, and social.
Automated capture of information as early as possible in the business process and as close to the point of origination produces cleaner data, resulting in higher quality information, less exception handling, and better process management. The more important the process is to a business, the greater the impact such improvements will have. Once paper-based information moves into the digital realm it can be used to enrich social and mobile applications. In paper form, that information might as well not exist since no one can get to it without great effort.
The reality that exists in most organizations suggests that although capture and its associated technologies are mature technologies, the market and the scale of implementation is anything but.
According to a recent AIIM study (Automating Financial Processes: User Feedback on the Real ROI), the average cost to process a paper invoice is still more than $9. Overall, 52 percent of organizations surveyed have yet to adopt any automated AP systems. One third of organizations receiving more than 25,000 invoices per month are still using paper-based processes.
These findings were reaffirmed in a follow-up AIIM survey (Process Revolution: Moving Your Business from Paper to PC to Tablet). A third of small and mid-sized companies and 22 percent of the largest have yet to adopt any paper-free processes. Only 20 percent of organizations of any size proactively evaluate all processes for driving out paper. The percentage of processes that could be paper free is actually only 14 percent. Seventy-seven percent of invoices that arrive as PDF attachments get printed. Thirty-one percent of faxed invoices get printed and scanned back in.
I could go on and on. Perhaps the most astonishing thing about all of this is how compelling the ROI actually is for scanning and capture – once people can be convinced to make the jump.
Per Process Revolution, on average respondents using scanning and capture consider that it improves the speed of response to customers, suppliers, citizens, or staff by six times or more. Seventy percent estimate an improvement of at least three times, and nearly a third (29 percent) sees an improvement of 10 times or more. Forty-two percent of users have achieved a payback period of 12 months or less from their scanning and capture investments. Fifty-seven percent are posting a payback of 18 months or less.
So the opportunity is there. I am convinced we can all do a better job of educating decision-makers about new cloud and open source models for delivering capture and content management technologies. I am also convinced that we can all do a better job of educating decision-makers about the benefits of capture and how to implement capture systems quickly and effectively. Hence, my great pleasure in writing the foreword to this book.
John Mancini,
Author, Speaker, and President of AIIM
About the Authors
Pat Myers is the Executive Vice President and a co-founder of Zia Consulting, a content centric solutions firm. Zia is a platinum Ephesoft and Alfresco partner that provides solutions from paper to mobile. Pat has over 10 years of Enterprise Content Management experience and 15 years of professional services and application development experience. Pat and Ike developed the official Ephesoft training.
I would like to thank my wife Margaret for giving me unconditional love and encouragement in everything I do, my daughter Zoe for making me remember what is important in life, and my God for giving me so many opportunities. Additionally, I would like to thank my extended family and friends for making my life so enjoyable. I would also like to thank my Zia family for making me want to go to work every day and achieve greatness.
Ike Kavas has more than 12 years of solid experience in document imaging, document management, workflow, and systems. Mr. Kavas is the founder and the Chief Technology Officer at