Extract data using the AI parsing engine | Parseur Support Center (2024)

Parseur's advanced AI technology now empowers you to extract data from documents by simply leveraging the field names within your mailbox. No more manual template setup – just seamless and accurate data extraction, regardless of language or document complexity.

Parseur's AI extraction feature introduces a new era of efficiency and convenience:

  1. Template-less extraction: Bid farewell to template creation and updates. Our AI-driven solution eliminates the need for manual setup, allowing you to automatically extract data from documents.

  2. Field names are the key: Guiding our AI to extract the precise data you require is as simple as naming the desired fields within your mailbox. These field names serve as intuitive cues for our AI to identify and extract the corresponding data.

  3. Multilingual proficiency: Parseur's AI understands and extracts data from documents in any language, ensuring global accessibility and applicability.

The AI engine has a few limitations to bear in mind:

  • Page count limitation: the AI is capable of extracting data from about 5 to 10 pages of any document. The exact number of pages can be slightly more or less, depending on the text density of your pages. In any case, Parseur will not charge you more than 10 credits per document.

Credit usage considerations: Price remains 1 credit = 1 page. Like the template-based approach, reprocessing documents with AI is free. Due to the heavy computational resources required for AI-driven extraction, we might contact you if we notice an extremely high rate of reprocessing.

Getting started with Parseur's AI parsing feature is quick and intuitive.

Step 1 : Create a new mailbox

Choose from our pre-defined mailboxes or create a customized mailbox tailored to your needs.

If you have an existing mailbox for which you want to use AI, enable the AI engine as described in step 2 below.

Step 2: Enable AI at mailbox level (or user level)

After selecting the mailbox type, click the AI checkbox to activate it.

Extract data using the AI parsing engine | Parseur Support Center (1)

You can also activate AI on an existing mailbox in the mailbox settings:

Extract data using the AI parsing engine | Parseur Support Center (2)

Finally, you can also activate AI for all of your existing mailboxes in your user account. Click on your name in the left menu > Account > Manage account > AI engine.

Step 3: Upload a sample document

Upload a representative sample document that showcases the type of data you want to extract.

Extract data using the AI parsing engine | Parseur Support Center (3)

Step 4: Configure your fields (Optional)

Wait for Parseur to analyze the document.

Then, if your mailbox already has fields (for example, if you chose one of our pre-defined mailboxes), Parseur will immediately start the extraction process.

Create simple fields

For custom mailboxes where there are no default fields, you will need to create some fields:

  1. Click on the uploaded document to view it

  2. Navigate to the Fields tab.

  3. Add the specific fields you wish to extract.

  4. Ensure these fields are named in a manner that the AI can easily understand, such as using terms like "InvoiceNumber" or "customer_address".

Extract data using the AI parsing engine | Parseur Support Center (4)

Create table fields to capture repeating data

To extract a list of repeating data, use the New Table button.

Then click on the "Add fields to <your field>" button to name the individual fields you want to extract from the table.

Extract data using the AI parsing engine | Parseur Support Center (5)

Repeat this for each field you want to add. For example: quantity, description, sku, price, etc.

Step 5: Process your document and check the results

After adding all desired extraction fields, click the "Process" button to initiate the AI-driven data extraction process.

Extract data using the AI parsing engine | Parseur Support Center (6)

Parseur AI didn't fetch the value I wanted for some of my fields. How can I train the Parseur AI model to do better?

Tip #1: use better field names

Parseur uses the name of your fields to find the relevant data in your documents. If the wrong value is fetched, try renaming the field to something more accurate that AI will better understand. Think of the AI as a data entry trainee that needs guidance to understand what you want.

For example, to capture the invoice number in invoice documents:

  • ❌ don't name the field Invoice or Number or invno

  • ✅ name it InvoiceNumber, invoice_number or Invoice number

Tip #2: delete unused or duplicate fields

The more fields you have, the more the AI tends to get some of them wrong. If tip 1 didn't help, try to restrict the number of extracted fields to the core of what you need.

Tip #3: consider using the template engine for some layouts

AI being a probabilistic model, it cannot guarantee 100% accuracy for all documents. If you need better results and don't manage to get them, you could consider creating some templates for some of the layout. Read more about the pros and cons our AI parsing engine vs template parsing engines.

Parseur only retrieved 1 data point from my documents. I have other similar data points in my document. How do I tell Parseur to extract all the data?

If the data repeats within a page, use Table fields instead of single fields:

  • Go to the the Fields tab when viewing a document

  • Click New Table

  • Name it something the AI will understand (for example, if you are working on extracting contact details, name the table something like ContactList)

  • Click Create

  • Click Add Field and name each field similar to the single fields you had previously

  • Delete the single fields so as not to confuse the AI

  • Reprocess your documents and check to see if you get the right results

If your document contains several individual documents (like several invoices, for example), use the Split PDF feature described below.

I have long documents; will AI be able to extract data from them?

AI will only be able to extract data from the first few pages of your document. The exact number depends on document density and the number of pages.

If you have long documents, you can consider the following options:

  • If you have a PDF consisting of several individual documents all bundled together, you can use the Split document feature to have Parseur cut the document into individual ones.

  • You can also consider using one of our two template engines: Text engine for emails and text documents and OCR engine for PDFs

I have some templates and the AI engine enabled in my mailbox. Which engine will be used to parse my documents?

Matching templates take priority over the AI engine. But if there are no matching templates, Parseur will use the AI Engine to extract your data.

How secure is my data when using the AI engine? Do you share my data to improve the AI model?

Parseur uses Azure AI to parse your data. Your data is processed in the European Union. Your data is not used to improve the AI model.

Related Articles

Create your first template to extract text from emailsDocument formats supported by ParseurCreate your first OCR template to extract text from PDFCustomize parsed data structureAI vs template parsing: pros and cons
Extract data using the AI parsing engine | Parseur Support Center (2024)
Top Articles
Something Stoic in Plato’s Sophist | Oxford Studies in Ancient Philosophy
How to Write a Memo in 8 Steps
Pet For Sale Craigslist
Tv Guide Bay Area No Cable
Arrests reported by Yuba County Sheriff
10000 Divided By 5
Joe Gorga Zodiac Sign
Beau John Maloney Houston Tx
Jc Post News
Shreveport Active 911
Current Time In Maryland
Bnsf.com/Workforce Hub
Spergo Net Worth 2022
Gentle Dental Northpointe
Www.publicsurplus.com Motor Pool
Closest Bj Near Me
Dragger Games For The Brain
Glover Park Community Garden
Mybiglots Net Associates
Inbanithi Age
Craigslist Apartments In Philly
Sorrento Gourmet Pizza Goshen Photos
City Of Durham Recycling Schedule
Die 8 Rollen einer Führungskraft
Truvy Back Office Login
Abga Gestation Calculator
Healthy Kaiserpermanente Org Sign On
Publix Christmas Dinner 2022
Imagetrend Elite Delaware
Happy Shuttle Cancun Review
Downloahub
Hannah Jewell
Mia Malkova Bio, Net Worth, Age & More - Magzica
Gridwords Factoring 1 Answers Pdf
47 Orchid Varieties: Different Types of Orchids (With Pictures)
Verizon TV and Internet Packages
Shiftwizard Login Johnston
24 slang words teens and Gen Zers are using in 2020, and what they really mean
2012 Street Glide Blue Book Value
Covalen hiring Ai Annotator - Dutch , Finnish, Japanese , Polish , Swedish in Dublin, County Dublin, Ireland | LinkedIn
Synchrony Manage Account
Magicseaweed Capitola
Kornerstone Funeral Tulia
craigslist: modesto jobs, apartments, for sale, services, community, and events
Wilson Tattoo Shops
Sara Carter Fox News Photos
Europa Universalis 4: Army Composition Guide
Zipformsonline Plus Login
bot .com Project by super soph
Das schönste Comeback des Jahres: Warum die Vengaboys nie wieder gehen dürfen
Wwba Baseball
Divisadero Florist
Latest Posts
Article information

Author: Amb. Frankie Simonis

Last Updated:

Views: 5977

Rating: 4.6 / 5 (56 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Amb. Frankie Simonis

Birthday: 1998-02-19

Address: 64841 Delmar Isle, North Wiley, OR 74073

Phone: +17844167847676

Job: Forward IT Agent

Hobby: LARPing, Kitesurfing, Sewing, Digital arts, Sand art, Gardening, Dance

Introduction: My name is Amb. Frankie Simonis, I am a hilarious, enchanting, energetic, cooperative, innocent, cute, joyous person who loves writing and wants to share my knowledge and understanding with you.