Named Entity Recognition.
Extract people, companies, amounts, IBANs and 13+ entity types from any document – powered by 800+ specialized LLMs, context-aware and in real time.
13+ entity types included – people, companies, amounts, IBANs, tax IDs and more. Everything in one analysis.
Included in every extraction
How it works
Upload
Send PDF, scan or image
AI Analysis
800+ LLMs extract entities
Structured Data
JSON with 13+ entity types
Every document is analyzed by the right AI model
Industry-trained language models recognize entities in context – 'Apple' as a company vs. fruit, 'Paris' as a city vs. person.
Zero-Shot Recognition
No training, no templates – ready to use immediately
Multi-Pipeline
LLM + Vision + Pattern matching for maximum coverage
Entity Resolution
Automatic deduplication and canonicalization
Knowledge Graph
Extracted entities form a connected knowledge model
What sets us apart
Context-Aware NER
AI understands meaning in context – not just pattern matching, but real language comprehension.
Zero-Shot Extraction
No training phase, no templates. Every document is understood immediately.
Entity Resolution
'James Wilson', 'J. Wilson' and 'Mr. Wilson' are recognized as the same person.
Knowledge Graph
Entities are automatically connected – see which people are linked to which companies.
Triple-Layer Pipeline
LLM extraction + Vision analysis + Pattern matching – triple security for every entity.
Table Context
Entities from tables are extracted with row and column context – not just isolated values.
139 Languages
Analyze documents in 139 languages – from English and French to Arabic and Chinese.
Real-Time API
Extract entities in under 100ms per page – faster than any human reviewer.
What we extract
From personal names to complex financial data – every information type is recognized.
People
Names, titles, roles, contacts
Organizations
Companies, institutions, departments
Locations & Addresses
Cities, countries, street addresses
Dates & Deadlines
Dates, deadlines, periods
Amounts
Prices, totals, currencies
IBANs & Bank Data
Account numbers, BICs, bank details
Email Addresses
Contact emails from any document
Phone Numbers
Landline, mobile, international formats
Invoice Numbers
Invoice IDs, receipt numbers
Order Numbers
Order IDs, PO numbers
Tax IDs
VAT ID, tax number, TIN
URLs & Links
Web addresses, deep links
Products
Item names, SKUs, services
Regex vs. Standard NLP vs. PaperOffice
Regex / Pattern
- Fixed patterns (IBAN, email)
- Context understanding
- New entity types without code
- Multilingual
- Table extraction
- Entity resolution
Standard NLP
- Standard entities (person, location)
- Context understanding
- Domain-specific types
- 139 languages
- Table extraction
- Entity resolution
PaperOffice AI
- 13+ entity types
- Context-aware LLM recognition
- Zero-shot, no templates
- 139 languages natively
- Tables with row/column context
- Entity resolution & knowledge graph
Entity extraction across all industries
Invoice Processing
Extract vendor, amounts, IBANs, tax IDs automatically for accounts payable automation.
Contract Analysis
Identify parties, dates, obligations, and key terms from legal documents.
Medical Records
Extract patient names, diagnoses, medications, and dosages from clinical documents.
Compliance & KYC
Identify people, companies, addresses for AML/KYC checks and sanctions screening.
Logistics & Supply Chain
Extract tracking numbers, delivery addresses, senders and order numbers from freight documents.
Property Management
Analyze lease agreements: tenants, areas, rent amounts, deadlines captured automatically.
HR & Recruiting
Extract applicants, qualifications, employers and salary expectations from resumes.
Insurance
Capture policy numbers, claim amounts, insured parties and claim dates from reports.
Integrate entity extraction in 5 minutes
# Entity-Extraktion aus Dokument
$ curl -X POST \
"https://api.paperoffice.ai/document_intelligence/entities" \
-H "Authorization: Bearer YOUR_API_KEY" \
-F "file=@invoice.pdf" \
-F "types=PERSON,ORG,MONEY,IBAN,DATE"
# Response: 13+ Entity-Typen mit Confidence
# {"entities":[{"type":"ORG","value":"...","confidence":0.99}]} Your data. Protected.
Named Entity Recognition
What is Named Entity Recognition (NER)?
Which entity types are supported?
How does context-aware NER work?
How accurate is entity extraction?
What is entity resolution?
Which languages are supported?
How do I integrate the NER API?
Is the processing GDPR compliant?
Customer Success Stories
Discover how businesses transformed their processes with PaperOffice AI.
Measurable customer success
"Enterprise document management for all our mining operations. World class."
"FDA-regulated pharmaceutical labels require seamless documentation. PaperOffice has reduced our approval process from days to hours."
"Technical documentation and order processing now run fully automatically."
"Digitization has revolutionized our administration. Citizen inquiries are now processed in minutes instead of days."
"Patient records, medical reports, and referrals are automatically captured and classified. Our practice team finally has time for their patients."
"8,000 employees and thousands of guest documents every day. PaperOffice has completely digitized our back-office processes."
"Loan applications and compliance documents are now processed in minutes instead of hours. The regulatory review is fully automated."
"Construction project documentation and compliance are now digital and traceable."
"Check-in forms, invoices, and guest communications are fully digital. Our concierge team now focuses on providing excellent service."
"Gas station billing, supplier documents, and compliance records from over 250 stations are automatically processed and archived."
"GMP-compliant documentation for pharmaceutical functional labels is now fully automated. Audit trails are seamless and instantly accessible."
"Blueprints, permits, and customer files for our wooden houses are now managed centrally. After six generations, we are finally paperless."
"Supplier contracts, certificates, and customs papers for hundreds of roasteries worldwide are automatically captured and assigned to the correct product."
"Temperature logs, CMR waybills, and food certificates are automatically scanned and assigned to the order. Misallocations are a thing of the past."
"Heavy transport permits, route plans, and project documentation are now instantly accessible digitally. No more searching through folders."
"Customs documents, warehouse receipts, and shipping orders for our Eastern European network are automatically classified. Four generations of logistics, finally digital."
"Printing specifications, customer approvals, and material certificates are now accessible centrally. The production error rate has dropped to near zero."
"Sustainability certificates, supplier contracts, and customer specifications are processed automatically. Swiss precision, now also digital."
"Building permits, subcontractor contracts, and acceptance protocols for our commercial properties are now fully documented digitally."
"Purchase agreements, exposés, and customer files from over 30 years of market leadership are now digitally searchable. Every agent finds everything in seconds."
"Ocean freight documents, customs declarations, and bills of lading are automatically captured. Baltic Sea logistics has never been so efficient."
"Thousands of custom packaging orders per week, including design approvals, print data, and delivery documents. PaperOffice keeps it all together."
"Pathology lab documentation and device certifications are now fully automated. Seamless traceability for every specimen."
"Loan documents, security papers, and customer correspondence are processed in minutes instead of hours. The regulatory review is seamless."
"Technical specifications, quality certificates, and recycling protocols for our steel production are central and instantly accessible."
"Pharma and cosmetic label specifications with regulatory requirements are automatically checked and approved. No more manual checklists."
"KYC documents, investment reports, and regulatory filings are classified in seconds. As an MAS-regulated robo-advisor, seamless compliance is essential."
"Safety data sheets, transport permits, and ADR documents for chemical logistics are automatically assigned. Zero tolerance for errors."
"Certificates of authenticity, supplier records, and customer warranties for our pearl collections are automatically archived and instantly accessible."
"SME loan applications, security documents, and regulatory reports are automatically classified. Our advisors have more time for customer consulting."
"Customer files from over 135 years of banking history are being successively digitized. 5,000 employees now have instant access to all documents."
"Microcredit applications and compliance documents for millions of customers are now processed in minutes instead of days. A game changer for financial inclusion."
"Hundreds of thousands of delivery notes and return slips per day are processed automatically. Vietnam's leading e-commerce logistics provider, now paperless."
"CNC manufacturing protocols, material certificates, and customer specifications for oil, gas, and aerospace projects are now fully documented digitally."
"Vision 2030 requires complete digitization. PaperOffice processes government documents for Saudi Arabia's digital backbone."
"Material certificates, hardening protocols, and customer specifications are automatically assigned to the correct order. Australia's only Q&T manufacturer, now paperless."
"Weld seam protocols, structural calculations, and project plans are managed digitally. Our workshop teams have access in real time."
"Aerospace certificates, CNC programs, and customer tolerances are automatically classified. AS9100 compliance has never been easier."
"Merchant contracts, KYC documents, and transaction receipts for hundreds of thousands of SMEs are processed in seconds. Mexico's payment revolution, paperless."
"Harvest documentation, export certificates, and quality protocols for our high-altitude Malbecs are automatically archived. 120 years of winemaking tradition, now digital."
"Millions of user verifications and regulatory documents are processed fully automatically. Scaling without paper."
"Merchant onboarding documents and compliance records for Africa's leading payment provider are processed in minutes instead of days."
"Organic certificates, supplier audits, and product labels for thousands of natural products are automatically checked and archived."
"Regulatory documents from 33 African countries, partner contracts, and audit trails are managed fully automatically. An enterprise DMS for a $3 billion fintech."
"Prescriptions and medication management now run fully automatically. More time for our patients."
"Centuries-old documents are now digitally searchable. A milestone for our historical archives."
"Our caregivers finally have more time for residents instead of paperwork."
"500,000+ records digitized. Our deputies now find all information instantly."
"Patient records management is now a breeze. Everything automatically captured and archived."
"Inheritance and estate documentation is now efficient and error-free."
"Digital property management for all our residential complexes. Tenants and owners are thrilled."
"With PaperOffice, we have accelerated our invoice processing by 99%. The AI automatically recognizes all relevant data and assigns it correctly."
"PaperOffice has become indispensable to our daily work. Orders, quotes, and invoices are now automated."
"The documentation of our care services is now digital and automated. More time for our patients."
"Technical drawings, bills of materials, and quality protocols are instantly searchable. A search that used to take 30 minutes now takes 30 seconds."
"Client receipts, tax assessments, and annual financial statements are automatically sorted and assigned to the correct client. Receipt chaos is a thing of the past."
"Project documentation, SLAs, and customer communication for our IT consulting projects are automatically classified and archived."
Entity extraction in seconds
Extract people, companies, amounts and 13+ entity types from any document – fully automated.