Technology: What’s under the hood?
How we’ve trained our AI to fully understand documents
Breakthrough computer vision and natural language processing for document understanding
Pre-trained from 55 million industrial documents, including invoices, receipts, packing lists, shipping labels, and forms
The pre-trained AI requires no additional training when deployed. Deployments can be as fast as 1 day.
Supercharge your document workflows today
Photon provides a universal data capture service to breakthrough deep learning, image classification, object recognition, image processing, and natural-language processing algorithms under the hood.
Photon's proprietary machine learning models have been trained on the largest datasets of industrial documents in the world.
Its training gets smarter over time. It returns structured data from any image, document, or scan by understanding the type, spatial layout, format, and data types of each field.
Read the API Documentation
Computer vision pipeline
Image quality triaging
Automatic thresholding
Boundary detection
Cropping regions of interest
Document classification
Structured layout comprehension
Object detection
Barcode scanning
Image transformations, auto-rotations
De-skew, affine transformations
Printed vs handwritten text classification
Zonal analysis
Cascaded, semi-supervised deep learning model
Pooling
Ensembling
Text capture
Signature validation
Cascaded case escalation
Structured text and barcode data output
Natural language processing pipeline
Constituency parsing
Semantic parsing
Bidirectional Recurrent Neural Networks (LSTMs)
Word vector space model similarity matching
Structured field matching
Data types validation
Domain-specific dictionary lookups
Spell checking
Selective case escalation
Relation extraction
Name matching
Address validation
Tracking # lookups
UPC, SKU, item lookups
Invoice, PO, receivables, ASN matching
Database queries
Exception handling and reconciliation
Data types transformations, Extract Transform Load
API integrations with WMS, ERP, IMS, DB
The Intelligent Document Processing solution for modern businesses
Pre-extraction:
Performs image pre-processing to increase the quality of the scanned document, captures, data, and indexes and classifies the documents into categoriesExtraction:
Captures relevant data leveraging Natural Language Processing for further processingPost-extraction:
Validates the extracted data with the help business logic, validation rules, and enterprise databases