Multi-Engine AI Technology for Intelligent Document Processing
Every extracted data point is verified in real time against your management system's master data. The system learns from your confirmations and continuously improves.
What is Intelligent Document Processing (IDP) and why choose Italian IDP solutions
Data Alchemy's Intelligent Document Processing (IDP) is an advanced software solution that automates the extraction and registration of data from business documents such as invoices, delivery notes, orders and contracts. Unlike traditional OCR software, our multi-engine AI technology combines 5 artificial intelligence models to guarantee 99.8% accuracy on any document type.
Why choose IDP software over traditional OCR?
- ✓Contextual understanding: AI interprets the meaning of data, not just reads it
- ✓Automatic validation: every field is verified against ERP master data in real time
- ✓Continuous learning: the system improves with every processed document
- ✓Zero configuration: no templates to create, works immediately on any format
IDP Solutions for Italian companies
Data Alchemy's IDP (Intelligent Document Processing) solutions address the specific needs of Italian SMEs: handling of XML FatturaPA electronic invoices, paper and digital DDTs, customer orders via email and commercial contracts. Unlike generic OCR software, our platform understands the Italian document context, automatically recognises VAT numbers, fiscal codes, SDI recipient codes and VAT rates, and integrates natively with the major ERP systems used in the Italian market such as SAP Business One, Zucchetti Ad Hoc, TeamSystem Enterprise and Mago.net.
Real-time verification
Every field is cross-checked against ERP/CRM master data: suppliers, items, prices, conditions. Anomalies are flagged before posting.
Automatic enrichment
Missing data in the document is automatically completed by pulling from master data: item codes, ledger accounts, cost centers.
Learning after 3 matches
After 3 matches confirmed by the same operator, the system automatically suggests future correspondences. Less human intervention, more speed.
Exception handling
Documents with uncertain data are routed to the appropriate operator with all data pre-filled. One click to confirm or correct.
Next-generation AI technology
An intelligent orchestrator selects the best AI model for each document type, ensuring maximum accuracy in every situation.
Data Alchemy's NLP engines: how Claude, GPT, Gemma, DeepSeek and CDBKR AI work together
At the heart of Data Alchemy's IDP software is a multi-engine natural language processing (NLP) architecture that combines the most advanced Large Language Models available on the market with a proprietary orchestrator. Instead of relying on a single model — with its specific weaknesses on certain layouts, languages or document types — the system automatically selects, page by page, the best-performing AI engine for that context. The result is 99.8% end-to-end accuracy on invoices, delivery notes, customer orders, contracts and price lists in real-world conditions.
Claude (Anthropic): contextual natural-language understanding
Claude by Anthropic is the NLP engine responsible for contextual understanding of long and nuanced text: commercial contracts, payment conditions, non-standard clauses and product descriptions. Its ability to interpret the semantic meaning of a document — not just recognise its characters — makes it ideal for extracting information from complex invoices with marginal notes, long body text, or cross-references to other documents.
GPT (OpenAI): structured data extraction from tables and forms
GPT by OpenAI is optimised for structured data extraction from tables, forms and documents with repetitive layouts. The model excels at identifying fields such as amounts, item codes, VAT rates, due dates and order references, producing JSON output that can be consumed directly by ERP systems. It is the primary engine for FatturaPA electronic invoices and order forms.
Gemma (Google): multi-language support for international suppliers
Gemma by Google is the NLP model dedicated to multi-language documents from international suppliers. In addition to Italian, it correctly processes invoices and orders in English, German, French, Spanish and other European languages, recognising local conventions on dates, currencies, decimal separators and tax formats. It is essential for Italian companies that import raw materials or work with foreign partners.
DeepSeek: complex tables and multi-column delivery notes
DeepSeek specialises in processing complex tables and documents with heavily structured layouts, such as multi-column delivery notes (DDT), packing lists with per-lot groupings, or bills of materials with nested hierarchies. Its tabular structure analysis algorithm preserves row-column correspondence even in the presence of merged cells, multi-level headers, or scanned formats of uneven quality.
CDBKR AI (proprietary): the orchestrator that coordinates every engine
CDBKR AI is our proprietary engine developed in-house: it does not process documents directly but acts as an orchestrator. It analyses each incoming page, classifies its type, language and quality, and routes it to the most suitable NLP engine. It then compares the outputs of the different models, resolves conflicts and applies business rules specific to the Italian market (VAT number, fiscal code, SDI recipient code, VAT rates). It is the intelligence layer that turns five AI models into a single, coherent and reliable platform.
REST API, webhooks and SQL: native integration with SAP, Zucchetti and TeamSystem
Data Alchemy's IDP platform integrates with any ERP or management system via REST API, webhooks and SQL queries — with native connectors for SAP, Zucchetti and TeamSystem and full support for the Italian electronic invoicing standard (FatturaPA). No vendor lock-in, maximum flexibility, ready in days, not months.
Integration Methods
Three connection modes to adapt to any existing IT infrastructure.
Output Formats
Extracted data is returned in the format best suited to your workflow.
Compatible ERPs
Native integration with the most widely used ERPs in businesses.
Email Integration
Connect a Google Workspace or Microsoft 365 mailbox. Data Alchemy monitors it automatically, filters spam and irrelevant messages, and processes only documents matching the assigned model.
Security and Access
Granular permission management and compliance with European regulations.
Full ERP/CRM integration
Connect your mailbox and automate document acquisition
Link a Google Workspace or Microsoft 365 mailbox to a document model. Data Alchemy monitors incoming mail, automatically recognises relevant documents, and discards spam and irrelevant messages — with zero manual intervention.
How it works
- 01Set up the mailbox: connect a dedicated address (e.g. logistics@data-alchemy.it) to a specific document model such as delivery notes, invoices, or orders
- 02Continuous monitoring: Data Alchemy checks the mailbox in real time and analyses every incoming message with AI
- 03Smart filtering: spam, newsletters, and emails unrelated to the model are automatically ignored
- 04Automatic processing: relevant documents (PDF attachments, images, structured emails) are extracted and processed just like any manually uploaded document
Practical example
You connect logistics@data-alchemy.it to the delivery note model. A supplier sends a delivery note PDF by email: Data Alchemy detects it, processes it, and enters the data into your ERP in 3 seconds. If a newsletter or marketing email lands in the same mailbox, it is simply ignored.
Google Workspace
Native integration with Gmail and Google Workspace. Simply authorise mailbox access.
Microsoft 365
Full compatibility with Outlook and Microsoft 365 via secure protocol.
AI spam filter
AI distinguishes business documents from spam and emails unrelated to the configured model.
Multi-mailbox
Connect multiple mailboxes to different models: one for delivery notes, one for invoices, one for orders.