Visual Experience Platform
Help CenterRelease NotesBlogWebsite
  • VXP - Visual Experience Platform
    • The VXP
      • Token
    • Settings
      • My Account
        • Profile
        • Support
      • Organisation
        • General Organisation Settings
        • Billing
          • Payment
          • Invoices
          • Plan
        • Users
          • Users
          • Roles
        • Teams
      • Project
        • Branding
        • Subscription
        • Analytics
          • Standard Dashboards
            • Multi tenants
            • User Dashboard
            • DAM Storage
            • Workflows Monitoring
            • Traffic Origin
            • Caching (Volumetry)
            • CDN performance
          • Custom Dashboard
        • Access
          • API keys
          • Security templates
          • OAuth2
  • Digital Asset Management (DAM)
    • Library
      • Assets
        • Asset Details window
          • Asset actions
            • Regional settings
            • Context menu ("..." button)
            • Add to My favorites (heart icon)
            • Edit media (image, video, etc)
              • Edit image
              • Edit video
              • Edit design template (coming soon)
            • Share
          • Asset information tabs
            • General
            • Metadata
            • Variations
            • Comments
            • Approvals
            • History
      • Folders
      • Collections
      • Labels
      • Products
      • My favorites
      • Help
      • Sharebox
      • Airbox
      • Search and Filters
        • Faceted Search
    • Plugins & Connectors
      • Plugins
        • Adobe Creative Cloud
        • Adobe Commerce (Magento)
        • Canva
        • Contentful
        • Contentstack
        • Directus
        • Drupal
        • Opencart
          • Opencart (v4)
        • Prestashop
        • Shopware
        • Storyblok
        • Strapi (v4)
        • Sylius
        • Uniform CMS
        • WordPress - VXP [Beta]
        • Wordpress
      • Connectors
        • Akeneo PIM
          • Akeneo Community PIM Connector
          • Akeneo Enterprise PIM App
        • Canva App
        • CI-Hub
        • Commercetools FaaS App
        • Hygraph app
        • Kontent.ai
        • OneTeg
        • Pabbly Connect
        • Prismic
        • Shopify / Shopify plus app
        • Zapier automation
    • Settings
      • Library
        • Components
        • Appearance
      • Metadata
        • Metadata Configuration
        • Assets
      • Tags
        • Configuration
        • Dictionary
      • Notifications
      • Automations
        • Post processing
        • Webhooks
        • Workflows
      • Storage
        • Providers
        • Upload
        • Video
        • Listing
        • Retrieval
        • Custom routing
  • Visual AI
    • Welcome
    • Visual AI
      • Images
        • Classification models
          • Auto-tagging
          • Brand detect
          • Dominant color extraction
          • Faces
            • Face analysis
            • Face clustering
          • Image quality
          • OCR
          • Number Plate recognition
          • Product type
          • Property classification
          • Scene Classification
          • Sport Classification
        • Generative AI models
          • Image-to-text
          • Plate blurring
          • Quality improvement (remove artifacts)
          • Remove background
          • Text-to-Image
        • Moderation models
          • Face count
          • NSFW - Not Safe For Work
          • Real estate authenticity
          • Watermark detection
      • Videos
        • Face detection
      • Search & find assets
        • Text Search
        • Similar Assets
  • Portals
    • Welcome
    • Creating a Portal
    • Editing a Portal
      • Pages
      • Sections
      • Design
      • Fonts
    • Managing a Portal
      • Settings
      • Access
      • Users
      • Labeling, Cloning, Archiving
    • Publishing a Portal
  • Dynamic Media Optimization (DMO)
    • Welcome
      • Responsive libraries
      • Native plugins
      • 360° view builder
      • Service status
    • Insights
      • Delivery
      • Optimization
      • Alerts
      • Logs
    • Transformations
      • Image optimization
        • Operations
          • Width and height
          • Prevent enlargement
          • Crop
            • Automatic Gravity Crop
            • Positionable Crop
            • Focal point Crop
            • Face Crop
            • Face hide
            • Aspect ratio crop
          • Fit
          • Cropfit
          • Bound
          • Boundmin
          • Cover
          • Device pixel ratio
          • Flip
          • Rotate
          • Trim
          • Rounded corners
        • Filters
          • Adjustment
            • Brightness
            • Contrast
            • Saturate
          • Color manipulation
            • Color overlay
            • Grayscale
            • Duotone
            • Sepia
            • Invert
          • Blur
          • Pixelate
          • Sharpen
        • Watermarking
          • Static watermark
          • Dynamic watermark
          • Text watermark
        • Image compression
          • Image formats
          • Optipress
      • Video optimization
        • Video API
          • Editing
            • Chapters
            • Combine
            • Trim
          • Optimizing
            • Convert
            • Compress
            • Transcode
        • On-the-fly-video optimization
      • Static content optimization
        • PDF to image
        • JS/CSS optimization
    • Invalidation
    • Settings
      • Asset Origin
        • AWS S3 or any other S3-compatible storage provider
        • Google Cloud storage (GCP)
        • Azure Blob storage
        • Own HTTP-based Storage
      • Images
        • Compression
        • Options
        • Watermark
      • Static content
      • Delivery
        • CNAME
        • URL format
        • Rules
        • Security
        • Caching
        • Default behaviors
  • Developers / Headless
    • Headless DAM
      • DAM APIs
        • API Authentication
        • API Reference
      • Command Line Interface (CLI)
      • Media Asset Widget(MAW)
        • Overview
        • Developer reference
        • V2 End-of-life
Powered by GitBook
LogoLogo

Quick links

  • Go to website
  • Legal Center

©2024 Scaleflex SAS

On this page
  • Overview
  • Use cases
  • API endpoints
  • Example API responses
Export as PDF
  1. Visual AI
  2. Visual AI
  3. Images
  4. Classification models

OCR

A machine-learning algorithm that utilizes optical character recognition techniques to accurately identify and extract text from images or scanned documents

PreviousImage qualityNextNumber Plate recognition

Last updated 7 months ago

Overview

The OCR ML model is an integral component of our service, providing robust optical character recognition capabilities with support for multiple languages.

The model is designed to convert text present in images or scanned documents into editable and searchable data. It leverages the power of machine learning techniques to accurately recognize characters and words, enabling efficient text extraction and analysis.

It examines the input, tries to detect any text fragments present in the image and recognizes the characters in those fragments according to the specified language. Detected fragments with a good enough confidence level are returned as text strings.

Use cases

The OCR service can be useful for multiple use cases, including:

  • Text extraction and indexing - The model extracts text from images or scanned documents, enabling efficient indexing and searching of digital assets. Users can find images or documents based on specific keywords or phrases mentioned within the text content.

  • Document digitization - Important information can be preserved by converting physical documents into digital formats.

  • Language translation - Combined with language translation capabilities, OCR can facilitate multilingual asset management, enabling users to search and translate text in various languages.

  • Improved accessibility - Converting text within images or scanned documents into machine-readable format enhances accessibility for visually impaired individuals, enabling screen readers or assistive technologies to interpret the content.

API endpoints

An up-to-date reference with all API endpoints is available here:

Example API responses

Input image
API response

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "09e7f3ba-f8fc-4ce1-b263-a717989cb4fa",
    "sha1": "9aeafd9e6b9aeb065a24a996f1a0b8cd57b97a6c",
    "language": "en",
    "result": "Las Vegas 72 Salt Lake City 493"
}
{
  "status": "success",
  "version": "2.9.3",
  "request_uuid": "3ca21912-dc2a-4008-9894-e7a5ea116286",
  "sha1": "3d50832674417549ae53ec0a27c3bfea3ebd946b",
  "language": "pl",
  "result": "Przepraszam, czy dostanę te buty w rozmiarze 39? Tak, chciałaby Pani przymierzyć? Tak. Proszę. Dziękuję  Są chyba trochę za małe. Czy ma pan czterdziestkę? To jest czterdziestka. Te są dobre. Wezmę je. Ile kosztują?"
}
Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API
Logo