Professional Documents
Culture Documents
Agenda
Workflow, Digitization and Digitization Workflow Definitions Simple Workflow Systems Common Features Required in a Digitization Workflow DAFv2 Overview
Data model System Architecture System Modules Achieving Flexibility Using DWMS Adaptation of BA workflow to AMEEL
What is Digitization?
The conversion of data from analog to digital or binary. Data could be object, image, document or a signal (usually an analog signal)
What is a Workflow?
It is a process and/or procedure in which tasks are completed. (Wiktionary) A workflow is a reliably repeatable pattern of activity enabled by a systematic organization of resources, defined roles and mass, energy and information flows, into a work process that can be documented and learned. (Wikipedia) Examples?
5
What is a Workflow?
It is not found on the internet as a concept yet. A process and/or procedure in which tasks are completed to convert (data) to digital form for use on a computer. How to Digitize a book?
Digitization Workflow
Books Arrival
OCR
QA-PDF
Manual workflow management using several software packages MS Excel MS SharePoint MS Project Good for small digitization projects No installation time (Startup cost) Minimum extra hardware
10
No Resources Management (e.g. Workstations and Users) Lack of projects and collections management Manual file handling between the storage server and clients Lack of handling workflow exceptions, dynamic evolution and deviations, except through manual intervention Manual maintenance of the relation with the LIS systems and digital repositories
11
12
Automation
Allows automatic processes without user interactions like; backup, batch image conversions, pdf creation, etc Automates file movements and Storage arrangement
13
Tracking Each Jobs current state, user, workstation and storage location Each Jobs history, including Operator, Machine, Time and date, and Action (Start, Finish, Reject, Redirect) User rates (per book or per page / first time or second time)
14
Management
Defines Projects, Job Types, Phases and manage them Simultaneously Assigns Users to specific Jobs or Projects at specific Phases (operations) Observes the overall backlogs to be able to re-allocate resources at the different Phases/Projects
15
Set Flow path sequence Phases can be added after the system is up and running.
16
Changes the normal flow of phases Downloads and Uploads to fix files Accepts external partially digitized jobs to start at the proper phase within the digitization workflow
Phase X Phase Y Phase Z
Reject
Phase Z
Redirect
17
Extract digitized material Metadata in an Automated way at Job insertion (Check-In) Support the integration with multiple LIS systems at the same time Automatically ingest and update the digitized material into the Repository
18
Hardware Requirements:
Software Requirements:
Installation:
DAFv2 Overview
20
DAF v2.0: Can be tailored to any environment Supports both manual and automated operations Tracks the history of the job's life cycle Is easy to install and configure Is flexible in defining digitization workflow phases Allows for defining different users access level Plug-In based System
21
Handlers
22
New Job
Checkin
Assign
File transfer
Upload
Download
Job assigned to next state Administrator accepts the recommendation Recommend re-do a phase to the job
23
Redirect
System Architecture
24
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
25
System Modules
26
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
27
Check-In
Plug-in based for integration Creates the Job in the system Assign the Job to any Phase
DWMS
aDORe Plug-in . . . .
DSpace
aDORe
28
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
29
Request a new Job Download and upload the Jobs folders and files Submit the Job back to the system to continue other Phases Reject a Job and recommend another Phase in addition to specifying reasons Redirect a Job from the default Phase Sequence Provide information on the files level to help solving problems
30
31
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
32
33
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
34
Reporting
Workflow Tracking Pending Items Late Jobs Operators Rates Build Customized Report
35
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
36
Archiving
On different Medias (CDs, DVDs, Tapes) with different size On online storage Confirm Media Successful reservation
37
Phase Manager
Check-In Module
Reporting Module
Administration Module
Archiving Module
Phase CX
Pre Phase CX
Phase CX
Post Phase CX
Phase CN
File Handler
Database Handler
LIS Server
LIS DAF
Off-line Storage
38
Check-Out
Java Reflection Call section of the XML Phases Definition Ingest the Jobs digital objects into the repository
DWMS
aDORe Plug-in . . . .
DSpace
aDORe
39
Quality Assurance
Maintain QA information on the files levels while moving from a Phase to another A QA Phase is defined in the Digitization Phase Sequence as the last Phase before the Archiving
Information of output objects (pages) level
Arabic Books QA
40
41
42
The defined Phase Sequence for a Job Type is a guide rather than a prescription The list of Phases may or may not being the Phase Sequence. The operator can assign the Job to any of all of these Phases Jobs can be Forwarded dynamically to another Phase in the Phase Sequence Changes in the Phase Sequence affect the current and new Jobs in the system, leading to natural process evolution
Arabic Books Processing Arabic Books OCRing Arabic Books Encoding & Publishing Arabic Books QA Arabic Books Archiving
43
44
Create a Check-In Plug-in to automate the ingestion of AMEEL Books into the System Create a Publishing Reflection Call to: Create Separate text files for each image Rename them according to the original names Move them to the Deliver FTP location on the FTP server
45
46
Thank You
47