Documentation

Documentation

  • Conversation
  • Reader
  • Speech
  • Console
  • AI Agents
  • Languages iconJanpanese
    • Tiếng Việt
    • English

›概要

概要

  • Driving License Recognition
  • ID Recognition
  • Passport Recognition
  • Facematch
  • Face Search
  • Reader
  • SDK eKYC
  • Liveness Detection

API

  • Driving Licence Recognition
  • ID Recognition
  • Passport Recognition
  • Face Search
  • Facematch
  • Liveness Detection

チュートリアル

  • Driving License Recognition
  • ID Recognition
  • Passport Recognition
  • Face Search
  • Facematch
  • SDK eKYC
  • Liveness Detection

Overview

FPT.AI Reader provides users a powerful AI solution to digitalize all business documents in efficient and convenient way.

Business user can:

  • Apply well-built OCRs rapidly in the marketplace with high accuracy.

  • Build own new OCR for business specific documents with efficient building flow.

  • Use and manage OCR process, results.

  • Upgrade OCR's accuracy continuously during using period.

  • Integrate OCR results easily to other business systems.

  • Analyze OCR results in multiple dimensions.

User Guides

Types of usage

  • Basic: Quick use ready-to-use OCRs (which are already well-built with very-high accuracy)*

  • Advance: Build own new OCR

1

1. Basic: Ready-to-use OCRs (Marketplace)

1.1 Open Marketplace

  • On page "Dashboard":

    • Click on button "OCR Marketplace" to open Marketplace

1.2 Browse Marketplace

  • On page "Marketplace":

    • Browse list of ready-to-use OCRs which grouped by Categories:

      • Personal docs

      • Insurance docs

      • Finance docs

      • Banking docs

      • General docs

    • Click button "Use" on a doc to start using it.

2. Advance: Build new OCR

2.1 Create

  • On page "Dashboard":

    • Click button "Add new"

    • Fill Creating popup:

      • Name: name of new application (ex: document name)

      • Type: processing type of application:

        • CROP app: main process is Cropping exactly document from original image (excluding irrelevant background)

          • Crop app could be used as first step of various OCR apps

          • Ex: "Card Crop" app could be used for OCR apps: ID card, Driver License card, ATM card...

        • OCR app: main process is Extracting required information from the document.

          • (Optional) While create OCR app, user has an option to select suitable CROP app

            • This is not required if normally original images has not much irrelevant background (ex: scanned documents)

2.2 Train

  • Samples (sidebar menu "Samples"):

    • Add:

      • Click "Add New"

      • Select files

      • Click "Upload"

    • Tag:

      • Click one document on the uploaded list to expand its sections

        • Note:

          • Section "Original": is the original uploaded image

          • Section "Crop": is the cropped image (if the app has Crop function)

          • Section "OCR": is the extracted information (if the app has OCR function)

          • Section has:

            • Button "Edit": to edit Crop/OCR result

            • Button "Add to Train": to add the doc to Training dataset

            • Button "Review": to mark as reviewed

      • Click "Edit" to open page "Edit" and start tagging data

        • Note:

          • For Crop app: adjust Cropping area by updating its 4 corner points

          • For OCR app: use below tools to tag the reading selections and their actual texts

          • Button "Add selection": to add new selection which requires OCR reading

            • Use mouse to draw rectangle selection

            • Button a selection to add/update its:

              • 'label' - name of the selection

              • 'value' - actual text in the selection

              • button 'delete' - delete the selection

            • Button "Remove all selections": to remove all current selections

            • Button "Zoom" in/out: to zoom the image for easier tagging

            • Button "Rotate": to rotate the image for easier tagging

            • Button "Add to Train": to add the current sample to Training dataset

  • Training (sidebar menu "Training"):

    • Click "Train" to start training process

    • View Training history details in the following list

    • Click "Set Prod" to publish the trained OCR model to start Using

3. Managements

3.1 Results

  • Upload files need to be OCRs:

    • Via Web UI

      • Click "Upload"

      • Select files and upload

    • Via API

  • View OCR results on the following table

  • Export OCR results as file

  • Improve OCR (for new OCR only):

    • Click "Add to sample" on a document which not correct result

    • Go to page Samples

    • Update the added result sample with correct data (Crop/OCR)

    • Training and Using again

3.2 Permissions

User can share the OCR app with a team in different roles:

  • Role "Viewer": who can view only

  • Role "Editor": who can edit bot's Training data (for new-OCR app only)

3.3 Settings

3.3.1 Using Settings

  • Input:

    • Call API

      • Project

      • Webhook

      • Key

    • Usage

  • Output:

    • Export Excel

    • Export API

3.3.2 Training Settings

  • (Premium User) Configure parameters for tuning Training model.
← Face SearchSDK eKYC →
  • Types of usage
  • 1. Basic: Ready-to-use OCRs (Marketplace)
    • 1.1 Open Marketplace
    • 1.2 Browse Marketplace
  • 2. Advance: Build new OCR
    • 2.1 Create
    • 2.2 Train
  • 3. Managements
    • 3.1 Results
    • 3.2 Permissions
    • 3.3 Settings
Conversation
DocumentationAPI ReferenceTutorials (Video)
Reader
DocumentationAPI ReferenceTutorials
Speech
DocumentationAPI ReferenceTutorials
Copyright © 2025 FPT Corporation