Insurance documents — EOBs, authorizations, claims, and remittance advices — arrive in different formats from every payer. A practice working with Blue Cross, Cigna, Aetna, UnitedHealthcare, and Medicare receives the same data in five different layouts. AI-first extraction tools like Lido handle this variation without per-payer templates, processing any payer’s documents on first upload. Relay, a healthcare claims processor, processed 16,000 Medicaid claims across dozens of managed care organization formats in five days without building a single template.
Every insurance document contains the same core data: payment amounts, adjustment codes, patient responsibility, denial reasons, approved units, and date ranges. The problem is that every payer presents this data differently. A Blue Cross EOB looks nothing like an Aetna EOB, which looks nothing like a Medicare remittance advice. The field labels change, the positions shift, the structures vary — and yet the underlying information your billing system needs is identical.
For a practice with 15 or more payer contracts, this means 15 or more document formats to handle. For billing companies processing on behalf of multiple practices, the numbers compound fast: hundreds of payer-format combinations flowing through the same office. One medical lab owner put it simply. He processes EOBs from Blue Cross and Cigna, calling them “same basic use cases, same thing, different formats.” He needed one system that handled both without building separate configurations for each payer.
Libertana, a Los Angeles home health agency, hits this problem at the authorization level. They receive insurance authorizations from Health Net, LA Care, CalOptima, and Anthem. Each payer sends authorizations in a different format with different field names and layouts, but all contain the same core data: approved units, coverage tiers, date ranges, and reimbursement rates. Processing these manually means learning four different document layouts for what is the same information.
Explanation of Benefits (EOBs) are the most variable insurance document type. Each EOB contains payment detail, adjustment codes (CARC and RARC), denial reasons, and patient responsibility amounts — but every payer arranges this information differently. A practice processing EOBs from 20 payers receives 20 different formats. Some list line items in tables, others use nested sections, and others spread the same data across multiple pages. This is why automated EOB extraction is one of the most common uses of AI document processing in healthcare.
Prior authorizations specify approved services, units, date ranges, and coverage tiers. Libertana works with five standardized authorization tiers at a $33/unit rate, but receives authorization documents in payer-specific formats from each insurer. The data they need — how many units were approved and for what date range — is consistent across payers. The documents that deliver that data are not.
Claims acknowledgments are acceptance or rejection notifications that vary by payer and clearinghouse. They are not as variable as EOBs, but still inconsistent enough to require format-aware processing. A rejection from one payer may use different status codes, formatting, and error descriptions than a rejection from another.
ERA versus paper EOB is a distinction worth calling out. The ANSI 835 standard gives Electronic Remittance Advices (ERAs) a consistent electronic format, but paper EOBs have no such standard. Many practices receive both — electronic for some payers, paper for others — creating a dual-format processing challenge. The remittance advice extraction workflow must handle both channels and produce the same structured output regardless of source.
Certificates of Insurance (COIs) are a separate problem for insurance companies themselves. Gallagher (AJG), a Fortune 500 insurance broker, processes COIs where coverage fields must use exact wording matching a requirements table. A single certificate produces three different data imports loading into separate database tables. The payer-specific formatting of coverage descriptions — where one insurer writes “Commercial General Liability” and another writes “CGL Coverage” — makes extraction harder, and the problem multiplies across hundreds of certificates.
The traditional approach to multi-payer document processing is to build a template for each payer: one for Blue Cross, another for Aetna, another for Cigna, another for UnitedHealthcare, another for Medicare, and so on. Each template maps specific coordinates or field positions to output columns. This is the core problem with template-based extraction. This works until it doesn’t, and it stops working fast. Payers update their formats routinely, and each update breaks the corresponding template without warning. A single field relocation on an Aetna EOB silently breaks extraction for every Aetna document until someone notices the data is wrong and rebuilds the template.
Billing companies face the worst version of this problem. Fifty practices multiplied by 20 payers creates hundreds of potential format variations. Maintaining templates for each combination is a full-time job that produces no revenue. The training-based alternative is not better: tools like Nanonets require 50 to 200 sample documents per document type to train a model. For 20 payers across 3 document types, that means 60 separate models to train and maintain — each one vulnerable to the same format-change breakage that plagues templates.
A gas distribution company called Esprigas went through all three approaches. They started with Docparser, where templates broke when document formats changed. They moved to Nanonets, which required constant retraining as new formats appeared. They ultimately moved to Lido, which required no templates and no training. This multi-tool migration pattern — templates, then training, then AI — is common among teams that tried the template and training approaches first and found they could not keep up with format variation.
The AI-first approach inverts the traditional workflow. Instead of building a template for each payer, you define your output columns once — matched to your billing system’s import format — and let the AI map each payer’s field names to those columns using contextual understanding. “Amount Paid” on a Blue Cross EOB, “Payment” on an Aetna EOB, and “Amt Pd” on a Medicare remittance advice all map to your Payment column automatically. No new configuration is needed when you add a payer or when a payer updates their format.
The results at scale are concrete. Relay processed documents from dozens of Medicaid managed care organization formats — zero templates, 16,000 claims processed in five days. US Neurology processes EOBs and invoices across 8 separate entities with multiple payers, consolidating everything into unified reporting. Neither organization built per-payer configurations. They defined what they needed once and let the AI handle format variation.
This is the same principle that works for multi-vendor invoice processing — just applied to the insurance domain, where format variation is even more extreme. An accounts payable team dealing with 200 vendor invoice formats faces the same structural problem as a billing company dealing with 20 payer EOB formats. The solution is identical: define output columns once, let the AI handle the mapping.
Format variation is only part of the challenge. The meaning of extracted data also varies by context. Take adjustment code interpretation: CO-18 (duplicate claim or service) requires different follow-up than CO-29 (timely filing limit expired) or PR-1 (patient deductible). A single extraction configuration can apply conditional logic — summing all line items by denial code category, separating paid amounts from denied amounts, and flagging specific codes for resubmission — without building separate rules for each payer.
Then there are payer-specific fields. Some payers include check numbers on their EOBs while others do not. Some use EFT reference numbers. Some include the rendering provider’s NPI while others omit it. A well-designed extraction template handles all of these variations with optional columns — populated when present, left blank when not — rather than requiring a different template for each payer’s field set.
Account number formatting is trickier than it looks. Some practices use account numbers that start with leading zeros — formats like 00-XXXX or 000-XXXX. These must be preserved as text, not treated as numbers that would strip the zeros. One lab owner specifically called this out as a problem: the AI was “totally ignoring” the instruction to preserve leading zeros until the extraction template was corrected with explicit formatting instructions. Special instructions like these apply across all payers from a single configuration. You fix it once, not once per payer template. For detailed guidance on setting up denial code routing and adjustment reason extraction, see how to extract data from EOBs automatically.
With AI-first extraction, you batch all payers together. No pre-sorting required. Upload a mixed stack of EOBs from Blue Cross, Aetna, Cigna, UnitedHealthcare, Humana, Medicare, and Medicaid — Lido handles all of them in a single run. There is no manual classification step. Lido reads each document and extracts based on content, not based on which payer sent it or which folder it was sorted into.
Volume benchmarks from real deployments show what this looks like in production. Relay’s 16,000-claim Medicaid workload, which had previously taken months, completed in five days using Lido and saved over 100 hours of manual work per week. US Neurology processes over 175,000 PDFs per year across 8 entities with multiple payers. Paper Alternative handles 6,000 CMS 1500 forms per month across payer variations and is scaling to 10,000 or more. None of these operations pre-sort documents by payer before processing.
Email automation removes the upload step. Forward EOBs from any payer to a dedicated inbox address, and Lido processes documents automatically as they arrive. A billing company receiving EOBs from 30 payers via email can route all of them to the same inbox. The AI reads each document, extracts the relevant fields, and delivers structured data regardless of which payer sent the original.
The output format is consistent regardless of the source payer. Your billing system import sees one standardized format whether the original document was a Blue Cross EOB, a Cigna remittance, or a Medicare summary notice. That is the goal: absorb format variation at the extraction layer so every downstream system receives clean, uniform data.
Multi-payer workflows often require cross-referencing patient identifiers. By uploading a context document — your patient master list or account roster — the AI matches extracted patient names and member IDs to your internal identifiers. This is similar to vendor master matching in accounts receivable workflows, where an uploaded vendor list maps varying vendor names on invoices to standardized internal codes.
Downstream workflows vary by practice. Some import extracted data directly via CSV into their practice management system. Others convert structured output to PDF for claim resubmission. Others push data via API to systems like AdvancedMD, Kareo, or athenahealth. One lab owner described his complete pipeline: Lido extracts data from mixed-payer EOBs into Excel, which converts to CSV, which feeds into a bulk PDF generation tool that creates 1,000 pages in 90 seconds, which then uploads to his billing system for claim resubmission. The entire pipeline runs on a single extraction configuration that handles every payer he works with.
Start by defining your output columns to match your billing system’s import format. Use your system’s exact column names (Patient_Name, CPT_Code, Billed_Amount, Adjustment_Code, etc.) so the output can be imported without reformatting. Next, upload a mixed test batch with EOBs from your top three to five payers by volume. Include at least one commercial payer (Blue Cross, Aetna, or Cigna), one Medicare RA, and one Medicaid MCO if applicable. Review the output, add special instructions for edge cases (leading zero preservation, denial code routing, optional fields), then reprocess with Lido’s free 24-hour reprocessing until the output matches your requirements. Once the template works on your test payers, it handles new payers automatically with no additional configuration.
Lido processes insurance documents — EOBs, authorizations, claims, and remittance advices — alongside invoices, purchase orders, bank statements, and other document types. The same AI that handles multi-payer format variation in healthcare also handles multi-vendor variation in accounts payable and multi-format variation in logistics. To see how document automation applies to your specific payer mix, see the complete guide to OCR for healthcare and medical practices.
No. You can upload a mixed batch of EOBs, authorizations, and claims from any combination of payers, and the system processes all formats in a single run. There is no pre-classification or sorting step required. The AI reads each document individually and extracts based on content, not based on payer identity or document format.
Yes. Paper EOBs are processed via AI extraction, which reads the document visually and maps fields to your output columns. Electronic 835 ERAs follow the ANSI standard format and are parsed accordingly. Both produce the same structured output, so your billing system receives consistent data regardless of whether the original was a scanned paper EOB or an electronic remittance file.
Nothing — no reconfiguration is needed. AI-first extraction reads documents contextually, understanding what each field represents based on its content and surrounding context rather than matching fixed coordinates or positions. When a payer moves a field, changes a label, or restructures their EOB layout, the extraction continues to work because the AI adapts to the new format automatically.
On clean digital PDFs, Lido achieves 99.5 to 100 percent field-level accuracy regardless of which payer generated the document. Scanned or faxed EOBs typically see 95 percent or higher. Paper Alternative, a healthcare BPO, meets a 99.5 percent accuracy requirement on CMS 1500 forms across payer variations. Lido offers free 24-hour reprocessing so you can refine extraction instructions until accuracy meets your billing system’s threshold.
Lido supports HIPAA-compliant document processing workflows. When evaluating any document extraction vendor for insurance documents, check for data encryption in transit and at rest, access controls, audit logging, and willingness to sign a Business Associate Agreement (BAA). See our guide to OCR for healthcare for detailed compliance considerations.
Use conditional extraction logic in your template instructions. For example, you can sum all CO-18 (duplicate claim/already paid) line items into one column and flag all CO-29 (timely filing limit) line items for resubmission in another. You can also separate PR-1 (patient deductible) amounts into a patient responsibility column. One set of instructions handles denial code logic across all payers without requiring per-payer configurations.
No. You can upload a mixed batch of EOBs, authorizations, and claims from any combination of payers, and the system processes all formats in a single run. There is no pre-classification or sorting step required. The AI reads each document individually and extracts based on content, not based on payer identity or document format.
Yes. Paper EOBs are processed via AI extraction, which reads the document visually and maps fields to your output columns. Electronic 835 ERAs follow the ANSI standard format and are parsed accordingly. Both produce the same structured output, so your billing system receives consistent data regardless of whether the original was a scanned paper EOB or an electronic remittance file.
Nothing — no reconfiguration is needed. AI-first extraction reads documents contextually, understanding what each field represents based on its content and surrounding context rather than matching fixed coordinates or positions. When a payer moves a field, changes a label, or restructures their EOB layout, the extraction continues to work because the AI adapts to the new format automatically.
On clean digital PDFs, Lido achieves 99.5 to 100 percent field-level accuracy regardless of which payer generated the document. Scanned or faxed EOBs typically see 95 percent or higher. Paper Alternative, a healthcare BPO, meets a 99.5 percent accuracy requirement on CMS 1500 forms across payer variations. Lido offers free 24-hour reprocessing so you can refine extraction instructions until accuracy meets your billing system’s threshold.
Lido supports HIPAA-compliant document processing workflows. When evaluating any document extraction vendor for insurance documents, check for data encryption in transit and at rest, access controls, audit logging, and willingness to sign a Business Associate Agreement (BAA). See our guide to OCR for healthcare for detailed compliance considerations.
Use conditional extraction logic in your template instructions. For example, you can sum all CO-18 (duplicate claim/already paid) line items into one column and flag all CO-29 (timely filing limit) line items for resubmission in another. You can also separate PR-1 (patient deductible) amounts into a patient responsibility column. One set of instructions handles denial code logic across all payers without requiring per-payer configurations.