2 of 14

Classification models

Machine learning models that analyze the content of an image and assign it to predefined categories or labels

Classification models in DAM

Machine learning models, particularly classification models, play a crucial role in digital asset management. Classification models are algorithms that learn patterns and relationships in data to assign predefined categories or labels to new, unseen data. They can be used to categorize and organize assets, such as images, videos, or documents, based on their content or characteristics.

Classification models are typically trained on labeled datasets, where each data instance is associated with a known category or label. During the training process, the model learns to recognize patterns and features in the input data that differentiate one category from another. Once trained, the model can classify new, unseen data based on the learned patterns.

Auto-tagging

Automatic assignment of relevant tags and keywords to the input image

Overview

Harnessing the power of visual assets is crucial for businesses across various industries. However, as digital libraries grow exponentially, manually assigning tags and keywords to images becomes a daunting and time-consuming task. That's where automatic tagging can help.

The Image tagging ML model is a powerful machine learning algorithm integrated into our service. It automatically analyzes the pixel content of images, extracts their features, and assigns relevant tags or keywords to facilitate improved organization and utilization of visual assets. With this model, businesses can unlock a new level of efficiency and productivity.

The process of AI tagging involves object detection techniques that detect objects, scenes, and concepts within the images. Based on this analysis, appropriate tags are assigned to each image, enabling proper asset management and content discovery.

Use cases

The AI tagging model can provide support for a wide range of typical use cases, including:

Efficient organization - Automatically assigned tags enable efficient organization of visual assets by categorizing them based on specific objects, scenes or concepts. Users can easily navigate through their image library and quickly locate the desired assets.
Accelerated search - Users can retrieve images based on generated tags or keywords. This accelerates the search process, making it effortless to find the required images in large collections.
Content Recommendation - Visually similar assets can be identified based on shared tags. Users can discover related content and leverage the full potential of their visual assets.
Visual content analysis - The model can aid in understanding visual trends, analyzing customer preferences and performing visual data analytics.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

API response

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "9ee38368-afb9-4d53-9a91-0581ebf1723a",
    "sha1": "d7bd5990a75d996fb27ff25a7649f5b0748df0e3",
    "count": 10,
    "tags": [
        {
            "confidence": 93.844,
            "tag": {
                "en": "Bird"
            }
        },
        {
            "confidence": 90.226,
            "tag": {
                "en": "Beak"
            }
        },
        {
            "confidence": 85.26,
            "tag": {
                "en": "Gesture"
            }
        },
        {
            "confidence": 83.657,
            "tag": {
                "en": "Grass"
            }
        },
        {
            "confidence": 80.749,
            "tag": {
                "en": "Plant"
            }
        },
        {
            "confidence": 79.05,
            "tag": {
                "en": "Happy"
            }
        },
        {
            "confidence": 78.175,
            "tag": {
                "en": "Wing"
            }
        },
        {
            "confidence": 75.138,
            "tag": {
                "en": "Feather"
            }
        },
        {
            "confidence": 72.807,
            "tag": {
                "en": "Seabird"
            }
        },
        {
            "confidence": 72.791,
            "tag": {
                "en": "Water"
            }
        }
    ],
    "file_downloaded": "4345ba03-f633-5c59-9f6e-a8b602950000.jpg"
}

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "7db96f16-8aaa-4274-a74d-7b927ab5d1d9",
    "sha1": "a1ed145a10339e4ba9a7ed8d56d058b877a9a98c",
    "count": 10,
    "tags": [
        {
            "confidence": 97.906,
            "tag": {
                "en": "Wheel"
            }
        },
        {
            "confidence": 97.475,
            "tag": {
                "en": "Tire"
            }
        },
        {
            "confidence": 94.934,
            "tag": {
                "en": "Vehicle"
            }
        },
        {
            "confidence": 92.089,
            "tag": {
                "en": "Car"
            }
        },
        {
            "confidence": 91.314,
            "tag": {
                "en": "Automotive tire"
            }
        },
        {
            "confidence": 90.513,
            "tag": {
                "en": "Automotive lighting"
            }
        },
        {
            "confidence": 89.459,
            "tag": {
                "en": "Automotive design"
            }
        },
        {
            "confidence": 88.105,
            "tag": {
                "en": "Motor vehicle"
            }
        },
        {
            "confidence": 87.549,
            "tag": {
                "en": "Alloy wheel"
            }
        },
        {
            "confidence": 84.563,
            "tag": {
                "en": "Bumper"
            }
        }
    ],
    "file_downloaded": "car.webp"
}

Brand detect

An ML model that detects logos of popular brands, visible in the input image

Overview

In today's digital landscape, brands are a vital aspect of any business. A brand logo serves as a symbol of identity, recognition, and trust. Managing and protecting brand assets is essential for maintaining brand consistency, ensuring compliance, and maximizing brand visibility. Our model is designed to streamline this process by automating the detection and categorization of brand logos within your digital assets, facilitating effective brand asset management.

The Brand recognizer is powered by state-of-the-art machine learning algorithms, and trained on vast datasets containing diverse brands from a wide selection of industries. This enables the model to accurately recognize and detect brand logos with varying sizes, orientations, and backgrounds within the provided images.

The model supports a vast number of popular brands and enables efficient brand analytics and monitoring, providing valuable insights and efficient brand management within our service.

Use cases

The following are some use cases, suitable for our Brand recognition model:

Brand monitoring and compliance - By identifying brand logos in user-generated content, the model assists in brand monitoring, ensuring compliance with brand guidelines and protecting brand integrity.
Brand analytics - The model provides insights into brand visibility and presence by analyzing the frequency and distribution of detected brand logos across digital assets. This information can be used for brand performance evaluation and marketing strategies.
Competition Analysis - By recognizing competitor brand logos within the digital assets, the model aids in conducting competitive analysis, identifying market trends, and evaluating brand positioning.
Image Retrieval - The Brand Recognizer enables efficient searching and retrieval of digital assets by logo-based queries. Users can locate specific assets based on recognized brands.
Digital rights management - Copyright and licensing agreements can be enforced with automatic logo detection and tracking. This ensures compliance with intellectual property regulations.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

Output preview

API response

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "259d598f-a01d-4828-988f-7745302043aa",
    "sha1": "db2b587ac357cf459a900f429098cac7fcfafaf1",
    "brand": {
        "extracted": [
            {
                "dimensions": {
                    "top": 799,
                    "bottom": 954,
                    "left": 174,
                    "right": 337
                },
                "vertices": [
                    {
                        "x": 174,
                        "y": 799
                    },
                    {
                        "x": 337,
                        "y": 799
                    },
                    {
                        "x": 337,
                        "y": 954
                    },
                    {
                        "x": 174,
                        "y": 954
                    }
                ],
                "classes": [
                    {
                        "class": "ineos",
                        "label": "Ineos"
                    }
                ],
                "meta": {
                    "id": "6b653e36-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.8488425301290431,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 483,
                    "bottom": 686,
                    "left": 735,
                    "right": 1005
                },
                "vertices": [
                    {
                        "x": 735,
                        "y": 483
                    },
                    {
                        "x": 1005,
                        "y": 483
                    },
                    {
                        "x": 1005,
                        "y": 686
                    },
                    {
                        "x": 735,
                        "y": 686
                    }
                ],
                "classes": [
                    {
                        "class": "ineos",
                        "label": "Ineos"
                    }
                ],
                "meta": {
                    "id": "6b762e20-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.9131165825918466
                }
            },
            {
                "dimensions": {
                    "top": 585,
                    "bottom": 629,
                    "left": 572,
                    "right": 693
                },
                "vertices": [
                    {
                        "x": 572,
                        "y": 585
                    },
                    {
                        "x": 693,
                        "y": 585
                    },
                    {
                        "x": 693,
                        "y": 629
                    },
                    {
                        "x": 572,
                        "y": 629
                    }
                ],
                "classes": [
                    {
                        "class": "ineos",
                        "label": "Ineos"
                    }
                ],
                "meta": {
                    "id": "6b762e21-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.8134818898436957
                }
            },
            {
                "dimensions": {
                    "top": 590,
                    "bottom": 635,
                    "left": 333,
                    "right": 444
                },
                "vertices": [
                    {
                        "x": 333,
                        "y": 590
                    },
                    {
                        "x": 444,
                        "y": 590
                    },
                    {
                        "x": 444,
                        "y": 635
                    },
                    {
                        "x": 333,
                        "y": 635
                    }
                ],
                "classes": [
                    {
                        "class": "ubs",
                        "label": "UBS"
                    }
                ],
                "meta": {
                    "id": "6b653e37-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.7896246827042861,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 473,
                    "bottom": 541,
                    "left": 355,
                    "right": 421
                },
                "vertices": [
                    {
                        "x": 355,
                        "y": 473
                    },
                    {
                        "x": 421,
                        "y": 473
                    },
                    {
                        "x": 421,
                        "y": 541
                    },
                    {
                        "x": 355,
                        "y": 541
                    }
                ],
                "classes": [
                    {
                        "class": "mercedes-benz",
                        "label": "Mercedes-Benz"
                    }
                ],
                "meta": {
                    "id": "6b653e38-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.758294096513573,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 432,
                    "bottom": 479,
                    "left": 478,
                    "right": 525
                },
                "vertices": [
                    {
                        "x": 478,
                        "y": 432
                    },
                    {
                        "x": 525,
                        "y": 432
                    },
                    {
                        "x": 525,
                        "y": 479
                    },
                    {
                        "x": 478,
                        "y": 479
                    }
                ],
                "classes": [
                    {
                        "class": "monster_energy",
                        "label": "Monster Energy"
                    }
                ],
                "meta": {
                    "id": "6b653e39-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.6406417129487874,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 652,
                    "bottom": 704,
                    "left": 566,
                    "right": 704
                },
                "vertices": [
                    {
                        "x": 566,
                        "y": 652
                    },
                    {
                        "x": 704,
                        "y": 652
                    },
                    {
                        "x": 704,
                        "y": 704
                    },
                    {
                        "x": 566,
                        "y": 704
                    }
                ],
                "classes": [
                    {
                        "class": "hewlett_packard_enterprise",
                        "label": "Hewlett Packard Enterprise"
                    }
                ],
                "meta": {
                    "id": "6b653e3a-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.6718056289900787,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 466,
                    "bottom": 565,
                    "left": 582,
                    "right": 677
                },
                "vertices": [
                    {
                        "x": 582,
                        "y": 466
                    },
                    {
                        "x": 677,
                        "y": 466
                    },
                    {
                        "x": 677,
                        "y": 565
                    },
                    {
                        "x": 582,
                        "y": 565
                    }
                ],
                "classes": [
                    {
                        "class": "petronas",
                        "label": "Petronas"
                    }
                ],
                "meta": {
                    "id": "6b653e3b-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.7984981628947344,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 726,
                    "bottom": 795,
                    "left": 337,
                    "right": 715
                },
                "vertices": [
                    {
                        "x": 337,
                        "y": 726
                    },
                    {
                        "x": 715,
                        "y": 726
                    },
                    {
                        "x": 715,
                        "y": 795
                    },
                    {
                        "x": 337,
                        "y": 795
                    }
                ],
                "classes": [
                    {
                        "class": "petronas",
                        "label": "Petronas"
                    }
                ],
                "meta": {
                    "id": "6b653e3c-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.9134711968459155,
                    "ocr_confirmed": true
                }
            },
            {
                "dimensions": {
                    "top": 548,
                    "bottom": 573,
                    "left": 330,
                    "right": 447
                },
                "vertices": [
                    {
                        "x": 330,
                        "y": 548
                    },
                    {
                        "x": 447,
                        "y": 548
                    },
                    {
                        "x": 447,
                        "y": 573
                    },
                    {
                        "x": 330,
                        "y": 573
                    }
                ],
                "classes": [
                    {
                        "class": "amg",
                        "label": "AMG"
                    }
                ],
                "meta": {
                    "id": "6b653e3d-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.6982388736604558,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 655,
                    "bottom": 682,
                    "left": 330,
                    "right": 457
                },
                "vertices": [
                    {
                        "x": 330,
                        "y": 655
                    },
                    {
                        "x": 457,
                        "y": 655
                    },
                    {
                        "x": 457,
                        "y": 682
                    },
                    {
                        "x": 330,
                        "y": 682
                    }
                ],
                "classes": [
                    {
                        "class": "crowdstrike",
                        "label": "CrowdStrike"
                    }
                ],
                "meta": {
                    "id": "6b653e3e-159d-11ee-92e7-b5eae7b56dc2",
                    "type": "logo",
                    "clarity": 0.6548185074579731,
                    "ocr_confirmed": false
                }
            }
        ],
        "has_logo": true
    },
    "file_downloaded_in": 0.09
}

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "5700077b-5b64-4b25-b386-06f4e6f99dec",
    "sha1": "e2f203d123d1f863efbe4efe69efd92654979275",
    "brand": {
        "extracted": [
            {
                "dimensions": {
                    "top": 237,
                    "bottom": 293,
                    "left": 746,
                    "right": 883
                },
                "vertices": [
                    {
                        "x": 746,
                        "y": 237
                    },
                    {
                        "x": 883,
                        "y": 237
                    },
                    {
                        "x": 883,
                        "y": 293
                    },
                    {
                        "x": 746,
                        "y": 293
                    }
                ],
                "classes": [
                    {
                        "class": "bybit",
                        "label": "Bybit"
                    }
                ],
                "meta": {
                    "id": "2b725a20-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.8146587118747994,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 285,
                    "bottom": 372,
                    "left": 180,
                    "right": 560
                },
                "vertices": [
                    {
                        "x": 180,
                        "y": 285
                    },
                    {
                        "x": 560,
                        "y": 285
                    },
                    {
                        "x": 560,
                        "y": 372
                    },
                    {
                        "x": 180,
                        "y": 372
                    }
                ],
                "classes": [
                    {
                        "class": "oracle",
                        "label": "Oracle"
                    }
                ],
                "meta": {
                    "id": "2b725a21-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.8986810549427108,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 160,
                    "bottom": 186,
                    "left": 585,
                    "right": 646
                },
                "vertices": [
                    {
                        "x": 585,
                        "y": 160
                    },
                    {
                        "x": 646,
                        "y": 160
                    },
                    {
                        "x": 646,
                        "y": 186
                    },
                    {
                        "x": 585,
                        "y": 186
                    }
                ],
                "classes": [
                    {
                        "class": "tezos",
                        "label": "Tezos"
                    }
                ],
                "meta": {
                    "id": "2b725a22-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.4980267747106751,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 321,
                    "bottom": 369,
                    "left": 690,
                    "right": 738
                },
                "vertices": [
                    {
                        "x": 690,
                        "y": 321
                    },
                    {
                        "x": 738,
                        "y": 321
                    },
                    {
                        "x": 738,
                        "y": 369
                    },
                    {
                        "x": 690,
                        "y": 369
                    }
                ],
                "classes": [
                    {
                        "class": "tezos",
                        "label": "Tezos"
                    }
                ],
                "meta": {
                    "id": "2b725a23-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.597878191955334,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 427,
                    "bottom": 452,
                    "left": 626,
                    "right": 655
                },
                "vertices": [
                    {
                        "x": 626,
                        "y": 427
                    },
                    {
                        "x": 655,
                        "y": 427
                    },
                    {
                        "x": 655,
                        "y": 452
                    },
                    {
                        "x": 626,
                        "y": 452
                    }
                ],
                "classes": [
                    {
                        "class": "puma",
                        "label": "PUMA"
                    }
                ],
                "meta": {
                    "id": "2b725a24-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.5123256180944338,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 401,
                    "bottom": 414,
                    "left": 124,
                    "right": 180
                },
                "vertices": [
                    {
                        "x": 124,
                        "y": 401
                    },
                    {
                        "x": 180,
                        "y": 401
                    },
                    {
                        "x": 180,
                        "y": 414
                    },
                    {
                        "x": 124,
                        "y": 414
                    }
                ],
                "classes": [
                    {
                        "class": "siemens",
                        "label": "Siemens"
                    }
                ],
                "meta": {
                    "id": "2b725a25-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.6122472914369587,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 397,
                    "bottom": 448,
                    "left": 691,
                    "right": 812
                },
                "vertices": [
                    {
                        "x": 691,
                        "y": 397
                    },
                    {
                        "x": 812,
                        "y": 397
                    },
                    {
                        "x": 812,
                        "y": 448
                    },
                    {
                        "x": 691,
                        "y": 448
                    }
                ],
                "classes": [
                    {
                        "class": "rauch",
                        "label": "Rauch"
                    }
                ],
                "meta": {
                    "id": "2b725a26-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.7545594366073807,
                    "ocr_confirmed": true
                }
            },
            {
                "dimensions": {
                    "top": 230,
                    "bottom": 264,
                    "left": 387,
                    "right": 463
                },
                "vertices": [
                    {
                        "x": 387,
                        "y": 230
                    },
                    {
                        "x": 463,
                        "y": 230
                    },
                    {
                        "x": 463,
                        "y": 264
                    },
                    {
                        "x": 387,
                        "y": 264
                    }
                ],
                "classes": [
                    {
                        "class": "citrix_systems",
                        "label": "Citrix Systems"
                    }
                ],
                "meta": {
                    "id": "2b725a27-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.7476709675969357,
                    "ocr_confirmed": true
                }
            },
            {
                "dimensions": {
                    "top": 218,
                    "bottom": 231,
                    "left": 846,
                    "right": 919
                },
                "vertices": [
                    {
                        "x": 846,
                        "y": 218
                    },
                    {
                        "x": 919,
                        "y": 218
                    },
                    {
                        "x": 919,
                        "y": 231
                    },
                    {
                        "x": 846,
                        "y": 231
                    }
                ],
                "classes": [
                    {
                        "class": "tag_heuer",
                        "label": "Tag Heuer"
                    }
                ],
                "meta": {
                    "id": "2b725a28-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.1704610042789269,
                    "ocr_confirmed": false
                }
            },
            {
                "dimensions": {
                    "top": 120,
                    "bottom": 355,
                    "left": 0,
                    "right": 390
                },
                "vertices": [
                    {
                        "x": 0,
                        "y": 120
                    },
                    {
                        "x": 390,
                        "y": 120
                    },
                    {
                        "x": 390,
                        "y": 355
                    },
                    {
                        "x": 0,
                        "y": 355
                    }
                ],
                "classes": [
                    {
                        "class": "red_bull",
                        "label": "Red Bull"
                    }
                ],
                "meta": {
                    "id": "2b725a29-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.9012839524460063,
                    "ocr_confirmed": true
                }
            },
            {
                "dimensions": {
                    "top": 405,
                    "bottom": 426,
                    "left": 71,
                    "right": 113
                },
                "vertices": [
                    {
                        "x": 71,
                        "y": 405
                    },
                    {
                        "x": 113,
                        "y": 405
                    },
                    {
                        "x": 113,
                        "y": 426
                    },
                    {
                        "x": 71,
                        "y": 426
                    }
                ],
                "classes": [
                    {
                        "class": "armor_all",
                        "label": "Armor All"
                    }
                ],
                "meta": {
                    "id": "2b725a2a-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.4053942317393592,
                    "ocr_confirmed": true
                }
            },
            {
                "dimensions": {
                    "top": 420,
                    "bottom": 442,
                    "left": 149,
                    "right": 185
                },
                "vertices": [
                    {
                        "x": 149,
                        "y": 420
                    },
                    {
                        "x": 185,
                        "y": 420
                    },
                    {
                        "x": 185,
                        "y": 442
                    },
                    {
                        "x": 149,
                        "y": 442
                    }
                ],
                "classes": [
                    {
                        "class": "hewlett_packard_enterprise",
                        "label": "Hewlett Packard Enterprise"
                    }
                ],
                "meta": {
                    "id": "2b725a2b-159c-11ee-9b4b-37b8298fd492",
                    "type": "logo",
                    "clarity": 0.4566395577699587,
                    "ocr_confirmed": false
                }
            }
        ],
        "has_logo": true
    }
}

Dominant color extraction

An ML model that extracts the most prominent colors in an image

Overview

The model analyzes the colors present in the input image and determines if they fall into several predefined ranges. It then compares the number of pixels in those ranges and outputs color names that were detected as visually dominant.

The pixels of the image are examined to identify the most prominent colors present. Multiple factors such as the distribution and frequency of different color values are taken into account to determine the dominant colors.

To find the color of a pixel, all values in the image are converted to an alternative color representation that is designed to be more closely aligned with the way human vision perceives colors. This enures high-quality results.

Use cases

Some typical use cases for this model include:

Visual search and filtering - Users can search for images based on specific dominant color criteria. For example, they can search for images dominated by a particular color to find assets that align with a specific theme or aesthetic.
Color-based categorization - The feature facilitates the automatic categorization of images based on their dominant colors. This allows for easier location and grouping of images with similar color schemes.
Branding Enhancement - Teams can utilize the dominant color information to ensure consistent branding and visual coherence. They can search for images that match their brand identity and integrate them into their campaigns.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Product image

API response

{
    "status": "success",
    "version": "2.9.3",
    "sha1": "f87f1c2e708b1111aa8aa99640efec02aaeb130b",
    "request_uuid": "0e00fce5-19a5-4411-b161-6894f9a7ff8e",
    "file_downloaded": "grahame-jenkins-p7tai9P7H-s-unsplash.jpg",
    "dominant_colors": [
        {
            "color": "azure",
            "hex": "0980fb",
            "pixels": 402721,
            "ratio": 0.16,
            "all": 2453760
        }
    ],
    "main_colors": [
        "azure"
    ],
    "main_colors_ratio": [
        0.16
    ],
    "main_colors_pixels": [
        402721
    ],
    "main_colors_hex": [
        "0980fb"
    ]
}

{
    "status": "success",
    "version": "2.9.3",
    "sha1": "cb29b29a902db664cb3bcb83a593d5dc7e0ae06f",
    "request_uuid": "e7ca3fd3-5561-4b48-adf3-4df8120e48c3",
    "file_downloaded": "420799-MF-MELON_RED.jpg",
    "dominant_colors": [
        {
            "color": "pink",
            "hex": "ee6893",
            "pixels": 293000,
            "ratio": 0.05,
            "all": 6220800
        }
    ],
    "main_colors": [
        "pink"
    ],
    "main_colors_ratio": [
        0.05
    ],
    "main_colors_pixels": [
        293000
    ],
    "main_colors_hex": [
        "ee6893"
    ]
}

Faces

Face analysis

A set of ML models that accurately detect human faces and predict crucial characteristics like facial landmarks, expression, ethnicity, age, and gender

Face analysis

A set of ML models that accurately detect human faces and predict crucial characteristics like facial landmarks, expression, ethnicity, age, and gender

Overview

The Face analyzer consists of several cutting-edge ML models. Its primary function is to detect visible human faces in images and predict some facial characteristics that are deemed important.

Leveraging state-of-the-art deep learning algorithms and neural networks, the Analyzer accurately identifies and analyzes faces, extracting the following information for each face:

position in the image (bounding box);
facial landmarks (coordinates of points that map to specific facial structures on the face);
expression classification (happy, angry, sad, etc.);
ethnicity classification;
age estimation;
gender classification.

Use cases

Use cases for automatic facial analysis include:

Image tagging and organization - Automatic facial analysis enables users to easily categorize and index images based on expressions, ethnicities, age groups and genders. This streamlines content management, making it easier to locate specific images for various purposes.
Inclusive representation - The Face Analyzer can facilitate inclusive representation in media content. By analyzing facial ethnicities and genders, content creators can ensure diversity and cultural representation in their visual assets, thus promoting inclusivity.
Search Optimization - Automatic tagging and categorization of images based on facial characteristics allow users to find specific faces, expressions, or ethnicities with ease.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

Output preview

API response

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "4f74f084-39a3-48f7-829a-336b55e8fd13",
    "result": "https://fdocs.filerobot.com/https://ask.filerobot.com/deliver/fdocs/4f74f084-39a3-48f7-829a-336b55e8fd13.png",
    "faces": {
        "face_0": {
            "box": [
                467,
                269,
                576,
                416
            ],
            "emotions": {
                "angry": 0.07,
                "disgust": 0.03,
                "fear": 0.02,
                "happy": 98.85,
                "sad": 0.3,
                "surprise": 0.58,
                "neutral": 0.16
            },
            "dominant_emotion": "happy",
            "races": {
                "asian": 0.89,
                "indian": 3.23,
                "black": 0.42,
                "white": 34.26,
                "middle eastern": 31.55,
                "latino hispanic": 29.64
            },
            "dominant_race": "white",
            "age": 34,
            "gender": "Male"
        },
        "face_1": {
            "box": [
                907,
                53,
                1017,
                207
            ],
            "emotions": {
                "angry": 1.92,
                "disgust": 0.13,
                "fear": 1.65,
                "happy": 31.76,
                "sad": 9.86,
                "surprise": 3.49,
                "neutral": 51.2
            },
            "dominant_emotion": "neutral",
            "races": {
                "asian": 0.01,
                "indian": 0.02,
                "black": 0.0,
                "white": 89.42,
                "middle eastern": 5.25,
                "latino hispanic": 5.3
            },
            "dominant_race": "white",
            "age": 25,
            "gender": "Male"
        },
        "face_2": {
            "box": [
                62,
                254,
                172,
                398
            ],
            "emotions": {
                "angry": 4.9,
                "disgust": 0.56,
                "fear": 0.16,
                "happy": 4.81,
                "sad": 22.33,
                "surprise": 0.13,
                "neutral": 67.1
            },
            "dominant_emotion": "neutral",
            "races": {
                "asian": 0.01,
                "indian": 0.0,
                "black": 0.0,
                "white": 99.67,
                "middle eastern": 0.12,
                "latino hispanic": 0.2
            },
            "dominant_race": "white",
            "age": 32,
            "gender": "Female"
        },
        "face_3": {
            "box": [
                746,
                346,
                844,
                474
            ],
            "emotions": {
                "angry": 0.55,
                "disgust": 0.29,
                "fear": 0.15,
                "happy": 97.94,
                "sad": 0.09,
                "surprise": 0.92,
                "neutral": 0.06
            },
            "dominant_emotion": "happy",
            "races": {
                "asian": 3.73,
                "indian": 9.26,
                "black": 1.63,
                "white": 27.46,
                "middle eastern": 26.19,
                "latino hispanic": 31.73
            },
            "dominant_race": "latino hispanic",
            "age": 36,
            "gender": "Male"
        },
        "face_4": {
            "box": [
                268,
                143,
                367,
                267
            ],
            "emotions": {
                "angry": 0.91,
                "disgust": 0.01,
                "fear": 0.11,
                "happy": 7.24,
                "sad": 16.02,
                "surprise": 0.69,
                "neutral": 75.03
            },
            "dominant_emotion": "neutral",
            "races": {
                "asian": 0.0,
                "indian": 0.0,
                "black": 0.0,
                "white": 99.97,
                "middle eastern": 0.02,
                "latino hispanic": 0.02
            },
            "dominant_race": "white",
            "age": 23,
            "gender": "Male"
        },
        "face_5": {
            "box": [
                1131,
                272,
                1225,
                400
            ],
            "emotions": {
                "angry": 0.04,
                "disgust": 3.63,
                "fear": 0.02,
                "happy": 95.44,
                "sad": 0.2,
                "surprise": 0.05,
                "neutral": 0.63
            },
            "dominant_emotion": "happy",
            "races": {
                "asian": 0.0,
                "indian": 0.0,
                "black": 0.0,
                "white": 99.88,
                "middle eastern": 0.07,
                "latino hispanic": 0.05
            },
            "dominant_race": "white",
            "age": 24,
            "gender": "Female"
        }
    },
    "file_downloaded": "friends.jpg"
}

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "cf655b9a-7467-4d2c-8425-a12fbb600af4",
    "result": "https://fdocs.filerobot.com/https://ask.filerobot.com/deliver/fdocs/cf655b9a-7467-4d2c-8425-a12fbb600af4.png",
    "faces": {
        "face_0": {
            "box": [
                630,
                161,
                940,
                573
            ],
            "emotions": {
                "angry": 0.9,
                "disgust": 0.29,
                "fear": 0.09,
                "happy": 2.27,
                "sad": 3.59,
                "surprise": 0.09,
                "neutral": 92.79
            },
            "dominant_emotion": "neutral",
            "races": {
                "asian": 0.03,
                "indian": 0.06,
                "black": 0.0,
                "white": 84.59,
                "middle eastern": 7.62,
                "latino hispanic": 7.7
            },
            "dominant_race": "white",
            "age": 26,
            "gender": "Female"
        }
    },
    "file_downloaded": "face.png"
}

Face clustering

A system that detects human faces, extracts feature vectors and clusters similar faces to efficiently group images based on the individuals present in them

Overview

The Face Clustering ML model is an advanced component offered as a part of our service. With its powerful facial detection and feature extraction capabilities, this model fundamentally changes the way images are organized and grouped based on the individuals present in them.

The model accurately detects visible human faces within images, regardless of their positions or orientations. This crucial first step ensures that all relevant faces are identified for subsequent clustering.

For each detected face, the model extracts a high-dimensional feature vector, representing the unique facial characteristics and attributes of that individual. These feature vectors capture critical facial traits while discarding irrelevant information, making them ideal for face-based comparisons.

Leveraging advanced machine learning algorithms, the model clusters all faces (and the respective images they are part of) based on the similarity of their feature vectors. Faces with similar features are grouped, enabling the automatic identification of distinct individuals in image collections.

The face clustering model can categorize faces without the need for explicit training data on individual identities. This flexibility allows the model to adapt to various datasets and expand its clustering capabilities.

Use cases

Automatic face clustering can have a significant impact on the following use cases:

Event photography management - Event photographs often contain multiple different individuals. The model simplifies the organization of these images by automatically grouping them based on the distinct people present. This streamlines the curation and delivery of photo collections.
Content organization - Automatic face clustering can make it easier for users to navigate and discover images related to specific individuals.
Photo collections - In personal photo libraries, the face clustering model assists in creating dedicated albums or collections for family members and friends, thus saving users valuable time by automatically organizing photos by individual identities.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Examples

Input images

Detected faces

Created clusters

Image quality

A ML model that evaluates the technical quality of images by analyzing their visual attributes

Overview

The quality of digital content is a critical factor in delivering satisfactory user experience and achieving branding goals. Our Image quality assessment (IQA) model is a machine learning solution designed to evaluate the visual quality of images.

When an image is fed into the system, the model processes it to assess various quality attributes and generates a quality score that reflects a weighted combination of those attributes, providing a precise assessment of the image's technical quality.

The attributes include multiple factors such as sharpness, blurriness, noise, dynamic range, contrast, distortion, etc. By providing a thorough image quality assessment, the model greatly enhances the management of digital assets and offers indispensable aid in content selection and optimization.

Typical use cases

Image quality assessment finds application in various domains and industries where image quality is a top priority:

Stock photography - Stock image providers can use the model to rate and categorize the images in their libraries, thus helping customers find ones with superior quality for their projects.
E-commerce - E-commerce platforms can ensure product images meet quality standards, improving the visual appeal of product listings and driving sales.
Publishing - Media outlets and publishing companies can automate the assessment of images for articles so their publications contain high-quality assets.
Advertising - Advertising agencies can use more easily evaluate the technical quality of campaign visuals, helping maintain brand consistency and professionalism.
Digital marketing - Marketers can optimize their social media and online content for greater impact and engagement.

API endpoints

Information about the specific API endpoints is available in an always up-to-date documentation, that can be accessed via the following link:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

There, you can find detailed information about the API endpoints, together with all required request parameters, so you know how to interact with them.

Example API responses

Input image

API response

{
  "status": "success",
  "version": "3.0.2",
  "quality_score": 0.17,
  "request_uuid": "ffbba05c-6231-4d6e-a8e6-6ae3e1b9cf27",
  "sha1": "4a12c83542e39c85eb412754c9d8a8deefd2822a"
}

{
  "status": "success",
  "version": "3.0.2",
  "quality_score": 0.84,
  "request_uuid": "84ced2cf-51fc-4792-a72f-ef74910d8e48",
  "sha1": "63bf86e6595db5ec2e7422d3941c6a72f729c48d"
}

OCR

A machine-learning algorithm that utilizes optical character recognition techniques to accurately identify and extract text from images or scanned documents

Overview

The OCR ML model is an integral component of our service, providing robust optical character recognition capabilities with support for multiple languages.

The model is designed to convert text present in images or scanned documents into editable and searchable data. It leverages the power of machine learning techniques to accurately recognize characters and words, enabling efficient text extraction and analysis.

It examines the input, tries to detect any text fragments present in the image and recognizes the characters in those fragments according to the specified language. Detected fragments with a good enough confidence level are returned as text strings.

Use cases

The OCR service can be useful for multiple use cases, including:

Text extraction and indexing - The model extracts text from images or scanned documents, enabling efficient indexing and searching of digital assets. Users can find images or documents based on specific keywords or phrases mentioned within the text content.
Document digitization - Important information can be preserved by converting physical documents into digital formats.
Language translation - Combined with language translation capabilities, OCR can facilitate multilingual asset management, enabling users to search and translate text in various languages.
Improved accessibility - Converting text within images or scanned documents into machine-readable format enhances accessibility for visually impaired individuals, enabling screen readers or assistive technologies to interpret the content.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

API response

{
    "status": "success",
    "version": "2.9.3",
    "request_uuid": "09e7f3ba-f8fc-4ce1-b263-a717989cb4fa",
    "sha1": "9aeafd9e6b9aeb065a24a996f1a0b8cd57b97a6c",
    "language": "en",
    "result": "Las Vegas 72 Salt Lake City 493"
}

{
  "status": "success",
  "version": "2.9.3",
  "request_uuid": "3ca21912-dc2a-4008-9894-e7a5ea116286",
  "sha1": "3d50832674417549ae53ec0a27c3bfea3ebd946b",
  "language": "pl",
  "result": "Przepraszam, czy dostanę te buty w rozmiarze 39? Tak, chciałaby Pani przymierzyć? Tak. Proszę. Dziękuję  Są chyba trochę za małe. Czy ma pan czterdziestkę? To jest czterdziestka. Te są dobre. Wezmę je. Ile kosztują?"
}

Number Plate recognition

An ML model that detects and recognizes vehicle registration plate numbers

Overview

The Number plate recognizer is a machine-learning model that detects and recognizes vehicle registration plate numbers in images.

It is developed and trained to detect vehicle license plates, recognize the characters on the plate, and return them as text strings. The model takes in an input image containing a vehicle and outputs the alphanumeric number present on the vehicle's registration plate in text form.

The process of license plate recognition consists of the following steps:

License plate detection;
Pre-processing the resulting image from Step 1 (warping, deskewing) to prepare it for optical character recognition (OCR);
Passing the image from Step 2 through an OCR engine and receiving the recognized characters.

Use cases

Some typical use cases for this model include:

Efficient vehicle cataloging - Vehicles can automatically be cataloged in a database. When new vehicle images are uploaded, the model can extract the registration plate numbers and associate them with the corresponding vehicle records, allowing for quick and accurate identification and retrieval of vehicle assets.
Smart searching - Users can leverage plate number recognition to perform targeted searches and easily locate specific vehicles, thus saving time and effort.
Automated metadata - Vehicle assets can be tagged with relevant metadata based on their registration plate numbers. This metadata may include vehicle make, model, year, and other information associated with the recognized plate.
Regulatory compliance - The model can aid in ensuring compliance with legal requirements related to vehicle registration and documentation.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Product image

API response

{
    "status": "success",
    "version": "2.9.3",
    "result": {
        "request_uuid": "3404ab7f-b62c-4c1a-b9ad-942c109e5522",
        "created_on": "2023-06-27 14:41:59.837681",
        "sha1": "ec0d6903226504ff24aac35f8d303a06dabbfc6e",
        "file_path": "/app/app/files/2a1da70c8e74f4155bf5ff93/sample1.jpg",
        "plate_number": [
            "B 44 VNN"
        ]
    }
}

{
    "status": "success",
    "version": "2.9.3",
    "result": {
        "request_uuid": "87594660-f3cf-4ff1-b266-198059100f4d",
        "created_on": "2023-06-27 14:43:17.740529",
        "sha1": "bd593aa23a75df1f4d4fcd63f17d38fb2c71ef17",
        "file_path": "/app/app/files/23baa18c814b0ba18c02da80/test.jpg",
        "plate_number": [
            "IN Q8038",
            "IN R8103"
        ]
    }
}

Product type

An ML model classifying an image as pure product image or application image, showing the product in use

Overview

The Product type classifier is a machine learning model, specifically designed to classify images into two categories:

Pure product images (packshot) typically show the product by itself in a clear and uncluttered way. The goal in this case is to focus on and highlight the product features and design.
Product application Images that show the product in use or context, often in a real-world setting. These images are designed to illustrate how the product can be operated, its potential benefits, and how it fits into a larger system or environment.

Product image

Application image

The model is particularly useful in the context of online stores and businesses dealing with large collections of product images. By leveraging its power, businesses can streamline their digital asset management workflows, organize image libraries, and deliver personalized user experiences.

Typical use cases

Some typical use cases include:

Image library organization - Online stores often have vast libraries of product images that need to be effectively organized and categorized. The classifier can automatically sort images based on their content, facilitating easy retrieval and management.
Search enhancement - The model allows for more accurate search and filtering capabilities. Users can easily find images that showcase the product itself or images that demonstrate its usage in real-world contexts.
Content Curation - The classifier aids in curating image galleries and content showcases by providing an automated classification of images. It aids in showcasing product images in relevant contexts, ensuring a coherent and focused presentation.
Product Recommendations - By understanding whether an image focuses on the product or its application, businesses can enhance their recommendation algorithms. The classifier can be used to tailor recommendations based on the user's preference for pure product images or application images.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Product image

API response

{
    "status": "success",
    "version": "2.9.0",
    "predictions": [
        {
            "application": 0.07,
            "product": 0.93
        }
    ],
    "request_uuid": "9f894a00-a101-402c-b556-3f274e2ab0e6",
    "sha1": "57f15ae25441e3ad8362ff50eb2a8763cd6fd809"
}

Application image

API response

{
    "status": "success",
    "version": "2.9.0",
    "predictions": [
        {
            "application": 1.0,
            "product": 0.0
        }
    ],
    "request_uuid": "32edac8a-d745-4a9f-825a-9fa9d74547e9",
    "sha1": "4d57f707f32b10b4cbb4f4d5f83451f5d2e6aa45"
}

Property classification

Precise classification of property-related images across various industries.

Overview

Efficient categorization of digital content is essential for organizing and retrieving assets effectively. The Property classification model is an advanced machine learning (ML) solution designed to identify and classify the type of real estate property depicted within an image.

The model analyzes the input image and determines if the displayed realty falls into one of the following property classes:

Apartment
Bathroom
Bedroom
Dining
Garage
House
Industrial
Kitchen
Living
Office

The Property classification feature is based on state-of-the-art computer vision techniques and deep learning algorithms that interpret the image data and identify patterns that are indicative of the property type depicted. This allows the model to accurately predict the predefined category that the image is part of, enabling precise classification. By doing this, this model significantly enhances the management of digital assets, streamlining content organization and retrieval for various industries.

Typical use cases

Property classification finds utility in a variety of use cases, all aimed at improving digital asset management and content organization:

Real estate listings - Real estate agencies can use the model to automatically classify and organize property images for listing websites, making it easier for potential buyers or renters to find properties of interest.
Property management - Property management companies can organize images of rental properties, simplifying property management and tenant communication.
Interior design - Interior design platforms can categorize images of furniture and home decoration products, thus enhancing the online shopping experience for customers.
Hospitality industry - The model can help hotels and hospitality businesses sort images of their rooms, facilities and dining areas for marketing and reservation platforms.
E-commerce product classification - E-commerce platforms can use the model to automatically categorize products based on the type of setting they are intended for (e.g., kitchen appliances, bedroom furniture).

API endpoints

Information about the specific API endpoints is available in an always up-to-date documentation, that can be accessed via the following link:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

There, you can find detailed information about the API endpoints, together with all required request parameters, so you know how to interact with them.

Example API responses

Input image

API response

{
  "status": "success",
  "version": "3.0.2",
  "predictions": [
    {
      "bedroom": 1,
      "apartment": 0,
      "kitchen": 0
    }
  ],
  "request_uuid": "a044541f-d996-4231-8a06-f68754b017a4",
  "sha1": "c8f52a7c740d48bc1fa428655ff2289940615c3e"
}

{
  "status": "success",
  "version": "3.0.2",
  "predictions": [
    {
      "apartment": 0.91,
      "office": 0.07,
      "dinner": 0.01
    }
  ],
  "request_uuid": "6aa97ee2-02cc-40cc-a228-78132855c88a",
  "sha1": "aa98fea5e0cd0aec274c07ec5bb7633b1d37c227"
}

{
  "status": "success",
  "version": "3.0.2",
  "predictions": [
    {
      "kitchen": 1,
      "living": 0,
      "bathroom": 0
    }
  ],
  "request_uuid": "520fc7e2-ee42-4961-9f22-0774bac40ea2",
  "sha1": "a5b0c6c8e481c4eb9410d5790a3e6ed68d28b525"
}

Scene Classification

An ML model that accurately classifies images, offering appropriate scene categories and attributes

Overview

The Scene classifier is a cutting-edge artificial intelligence system integrated into our service. Its primary function is to analyze and classify images based on their environmental context and distinguish between indoor and outdoor scenes. Leveraging the power of deep learning and neural networks, this model can accurately determine the setting of an image and provide insights about various scene categories, that the image has been determined to be part of.

The model offers a rich set of scene categories and attributes to help users understand the context of their images better. Some of the scene categories include landscapes, beaches, mountains, forests, interiors, exteriors, etc. It can also identify specific attributes like natural lighting, artificial lighting, crowded spaces, open spaces, urban settings, rural settings and more.

Typical Use Cases

Scene classification can be appropriate in the following use cases:

Content organization - The Scene classifier plays a vital role in automatically organizing large collections of images. By accurately determining whether an image is taken indoors or outdoors, users can easily filter and group their assets based on environmental context, streamlining content management and retrieval.
Marketing campaigns - Marketing teams can sort and analyze images that are relevant to specific campaigns. For instance, for an outdoor adventure campaign, the system can help select images of landscapes, mountains, and beaches, thus ensuring consistent and fitting visuals.
Search optimization - Scene classification adds a new layer of metadata to the images. This enhances search capabilities, enabling efficient searching for specific scenes or environments.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

API response

{
    "status": "success",
    "version": "2.9.3",
    "predictions": {
        "environment": "indoor",
        "scene_categories": [
            {
                "category": "arena/performance",
                "probability": 0.902
            },
            {
                "category": "stage/indoor",
                "probability": 0.063
            },
            {
                "category": "discotheque",
                "probability": 0.013
            },
            {
                "category": "orchestra_pit",
                "probability": 0.009
            },
            {
                "category": "stage/outdoor",
                "probability": 0.005
            }
        ],
        "scene_attributes": [
            "no horizon",
            "indoor lighting",
            "man-made",
            "enclosed area",
            "spectating",
            "congregating",
            "glossy",
            "cloth",
            "socializing"
        ]
    },
    "request_uuid": "026c192d-061e-4db6-9723-fa56fb73778b",
    "sha1": "2975abaf7208621839a1c41a09cc714b2e32498b"
}

{
    "status": "success",
    "version": "2.9.3",
    "predictions": {
        "environment": "outdoor",
        "scene_categories": [
            {
                "category": "lagoon",
                "probability": 0.389
            },
            {
                "category": "coast",
                "probability": 0.136
            },
            {
                "category": "beach",
                "probability": 0.122
            },
            {
                "category": "ocean",
                "probability": 0.082
            },
            {
                "category": "beach_house",
                "probability": 0.075
            }
        ],
        "scene_attributes": [
            "natural light",
            "open area",
            "sunny",
            "far-away horizon",
            "natural",
            "swimming",
            "boating",
            "diving",
            "clouds"
        ]
    },
    "request_uuid": "be1cc139-cad4-4ea2-958a-6f6ecd201899",
    "sha1": "7ff3940744ebdba29fce91b8b5e3648da2d258a2"
}

Sport Classification

An ML model that identifies sport activities depicted in images, empowering diverse industries with precise content organization and targeted applications

Overview

The Sport classifier is a state-of-the-art artificial intelligence service. Its primary function is to analyze and classify images based on the sports activities they depict. Utilizing cutting-edge deep learning techniques and neural networks, this model can accurately identify various sports and provide valuable insights for content organization and a wide range of applications.

The model offers a diverse set of sports categories, including but not limited to football, basketball, tennis, swimming, gymnastics, baseball and many more. It ensures precise tagging and categorization of sports-related images, making it an indispensable tool for media agencies and content managers.

Typical Use Cases

Sports classification can be applied in multiple domains, including:

Sports media coverage - By quickly identifying the sport depicted in an image, journalists and editors can efficiently organize their image archives and provide real-time coverage during sporting events, enhancing storytelling and enriching their publications.
Sports analytics - The model can aid in automating the process of collecting data by categorizing images from games and training sessions.
Sports product marketing - Sports brands and retailers can utilize the model to optimize their marketing efforts. Sorting images of athletes engaging in specific sports can aid in the creation of targeted campaigns.
Search enhancement - Precise sport categorization can optimize the process of searching and filtering assets, enabling users to find relevant images for their projects with ease.

API endpoints

An up-to-date reference with all API endpoints is available here:

Scaleflex API for Digital Asset Management (DAM), Visual AI and Media OptimizationScaleflex API

Example API responses

Input image

API response

{
    "status": "success",
    "version": "2.9.3",
    "sport": {
        "info": {
            "name": "Weightlifting",
            "confidence": 0.94
        },
        "detected": true
    },
    "request_uuid": "883f7bcf-ca34-4d34-b5c4-63ebcdf459a8",
    "sha1": "425ed85234fc7d9419b7c37f3cf7fca1679f0832"
}

{
    "status": "success",
    "version": "2.9.3",
    "sport": {
        "info": {
            "name": "Basketball",
            "confidence": 0.97
        },
        "detected": true
    },
    "request_uuid": "894ff1b7-6d8d-4ac2-a3ad-3988a48bf99d",
    "sha1": "fdc5cdb6e033e27fe4b0267afbce23b287811e8c"
}