Curiosity for Developers
  • Overview
  • Getting Started
    • Introduction
    • System Overview
      • Workspace
      • Connectors
      • Front End
    • Requirements
    • Installation
      • Deploying on Windows
        • Download Curiosity Workspace for Windows
      • Deploying on Docker
        • Deploying using Docker Desktop App
        • Docker Hub
      • Deploying on Kubernetes
      • Deploying on OpenShift
      • Configuration
    • Configure your Workspace
    • Connecting to a Workspace
      • Download App
    • Built-in Templates
  • Security
    • Introduction
    • Hosting
    • Encryption
    • Users and Access
      • User Invitations
      • Single Sign-On (SSO)
        • Google Sign-In
        • Microsoft / Azure AD
        • Okta
        • Auth0
    • Permissions Management
    • Auditing
    • Teams management
    • Configuring Backup
      • Restoring a backup
    • Activate a workspace license
  • Data Sources
    • Introduction
    • User Apps
    • Workspace Integrations
    • API Integrations
      • Introduction
      • Data Modeling
      • Writing a Connector
      • Access Control
      • API Tokens
      • API Overview
      • Tips
    • Supported File Types
    • Curiosity CLI
      • Installation
      • Authentication
      • Commands
  • Search
    • Introduction
    • Languages
    • Synonyms
    • Ranking
    • Filters
    • Search Permissions and Access Control
  • Endpoints
    • Introduction
    • Creating an endpoint
    • Calling an endpoint
    • Endpoint Tokens
    • Endpoints API
  • Interfaces
    • Introduction
    • Local Development
    • Deploying a new interface
    • Routing
    • Node Renderers
    • Sidebar
    • Views
  • Artificial Intelligence
    • Introduction
    • Embeddings Search
    • AI Assistant
      • Enabling AI Assistant
    • Large Language Models
      • LLMs Models Configuration
      • Self-Hosted Models
    • Image Search
    • Audio and Video Search
  • Sample Workspaces
    • Introduction
    • HackerNews
    • Aviation Incidents
    • Covid Papers
    • NASA Public Library
    • Suggest a Recipe
  • Basic Concepts
    • Graph database
    • Search Engine
  • Troubleshooting
    • FAQs
      • How long does it take to set up?
      • How does Curiosity keep my data safe?
      • Can we get Curiosity on-premises?
      • Can I connect custom data?
      • How does Workspace pricing work?
      • Which LLM does Curiosity use?
      • What's special about Curiosity?
      • How are access permissions handled?
      • What enterprise tools can I connect?
      • How to access a workspace?
      • How do I hard refresh my browser?
      • How do I report bugs?
      • How do I solve connectivity issues?
      • How do I contact support?
  • Policies
    • Terms of Service
    • Privacy Policy
Powered by GitBook
On this page
  • What is OCR?
  • OCR Support in Curiosity
  • Which file types are supported with OCR?
  • Configuring OCR in a Curiosity Workspace
  1. Artificial Intelligence

Image Search

How to configure Image Search in a Curiosity Workspace

What is OCR?

OCR, or Optical Character Recognition, is a technology that extracts text from different types of documents, like scanned paper documents, PDF files or images captured by a digital camera.

OCR Support in Curiosity

Curiosity Workspaces include OCR capabilities to enhance the efficiency and accuracy of data retrieval. This includes:

  • Image Documents: Curiosity can process JPEG, PNG, TIFF, and BMP files, extracting text for indexing and searching.

  • Scanned Documents: Curiosity can extract text from scanned documents like PDFs or scanned images.

  • Multi-Language Support: Curiosity can recognize a range of languages, including English, French, Spanish, German, and Portuguese.

Which file types are supported with OCR?

Curiosity supports OCR for the following file types:

  • Images ( .png, .jpg, .jpeg, .gif, .tif, .tiff, .bmp, .dng, .webp, .raw, .heic, .heif, .psb, .svg, .odg, .otg, .odi)

  • PDF scans (.pdf files where there are only images in the content)

Configuring OCR in a Curiosity Workspace

Documentation coming soon...

PreviousSelf-Hosted ModelsNextAudio and Video Search

Last updated 10 months ago