Curiosity for Developers
  • Overview
  • Getting Started
    • Introduction
    • System Overview
      • Workspace
      • Connectors
      • Front End
    • Requirements
    • Installation
      • Deploying on Windows
        • Download Curiosity Workspace for Windows
      • Deploying on Docker
        • Deploying using Docker Desktop App
        • Docker Hub
      • Deploying on Kubernetes
      • Deploying on OpenShift
      • Configuration
    • Configure your Workspace
    • Connecting to a Workspace
      • Download App
    • Built-in Templates
  • Security
    • Introduction
    • Hosting
    • Encryption
    • Users and Access
      • User Invitations
      • Single Sign-On (SSO)
        • Google Sign-In
        • Microsoft / Azure AD
        • Okta
        • Auth0
    • Permissions Management
    • Auditing
    • Teams management
    • Configuring Backup
      • Restoring a backup
    • Activate a workspace license
  • Data Sources
    • Introduction
    • User Apps
    • Workspace Integrations
    • API Integrations
      • Introduction
      • Data Modeling
      • Writing a Connector
      • Access Control
      • API Tokens
      • API Overview
      • Tips
    • Supported File Types
    • Curiosity CLI
      • Installation
      • Authentication
      • Commands
  • Search
    • Introduction
    • Languages
    • Synonyms
    • Ranking
    • Filters
    • Search Permissions and Access Control
  • Endpoints
    • Introduction
    • Creating an endpoint
    • Calling an endpoint
    • Endpoint Tokens
    • Endpoints API
  • Interfaces
    • Introduction
    • Local Development
    • Deploying a new interface
    • Routing
    • Node Renderers
    • Sidebar
    • Views
  • Artificial Intelligence
    • Introduction
    • Embeddings Search
    • AI Assistant
      • Enabling AI Assistant
    • Large Language Models
      • LLMs Models Configuration
      • Self-Hosted Models
    • Image Search
    • Audio and Video Search
  • Sample Workspaces
    • Introduction
    • HackerNews
    • Aviation Incidents
    • Covid Papers
    • NASA Public Library
    • Suggest a Recipe
  • Basic Concepts
    • Graph database
    • Search Engine
  • Troubleshooting
    • FAQs
      • How long does it take to set up?
      • How does Curiosity keep my data safe?
      • Can we get Curiosity on-premises?
      • Can I connect custom data?
      • How does Workspace pricing work?
      • Which LLM does Curiosity use?
      • What's special about Curiosity?
      • How are access permissions handled?
      • What enterprise tools can I connect?
      • How to access a workspace?
      • How do I hard refresh my browser?
      • How do I report bugs?
      • How do I solve connectivity issues?
      • How do I contact support?
  • Policies
    • Terms of Service
    • Privacy Policy
Powered by GitBook
On this page
  • Overview of supported file types
  • Customizing Indexed File Types
  • Feedback and Suggestions
  1. Data Sources

Supported File Types

File types supported by Curiosity

Curiosity indexes a wide range of file types, enabling users to interact with contents of various types. Developers can customize which file types are indexed and searchable within their Curiosity workspace..

Overview of supported file types

Documents

  • Word Processing: doc, docx, odt, rtf, Google Docs, ott, odm, dot, dotx, docm, dotm, odf, afpub

  • Spreadsheets: xls, xlsx, ods, Google Sheets, xlt, csv, tsv, xlsm, xltm, ots

  • Presentations: ppt, pptx, odp, otp, Google Slides, ppsx, potx, ppam, ppsm, pptm, potm, pot, pps, ppa

  • PDFs: pdf, eps, xps

  • Ebooks: epub, mobi

  • Text Files: txt, md, log, json, xml, vcard

  • OneNote: one, onetoc2, onepkg

Emails, Messages, Calendars

  • Emails: msg, eml

  • Outlook Archives: pst

  • Calendars: ics, Google Calendar events

  • Messaging Platforms: Slack messages, Teams messages

Attachments within emails and messages are also indexed and searchable.

Images and Graphics

  • Images: png, jpg, jpeg, gif, tif, tiff, bmp, dng, webp, raw, heic, heif, psb, vg, odg, otg, odi

  • Diagrams: vsd

  • Drawings and Design Files: dwg, dxf, vsd, dgn, indd, stl, odc, Google Drawing, ai, xmind

  • Adobe Photoshop: psd

  • Figma: fig

  • Affinity Designer: afdesign,aftemplate, afphoto

Diagrams and drawings are searchable by file name only.

Audio and Video

  • Video Files: mp4, wmv, mpeg, avi, mkv, mov, ogv, 3gp, m4a, oga, weba, webm,flv

  • Audio Files: mp3, wav, mka, wma, flac, aac, aiff

Code and Development Files

  • Code Files: ps, cs, fs, css, js, class, java, c, cpp,h, php, py,sh, bat, .swift, vb

Code files are currently searchable by file name only.

Other File Types

  • Webpages: html, htm

  • Archives: zip, rar, 7z, ace, gz, bz, bz2, tar, cab, onepkg

Processing for archive files includes processing for the contents of the archived files.

Customizing Indexed File Types

Feedback and Suggestions

PreviousTipsNextCuriosity CLI

Last updated 1 year ago

Text in images and scans is extracted using the built in .

Audio and video files are currently searchable by name and content using the built in .

To adjust which file types are indexed for search in your workspace, refer to our guide on .

Missing a file type? Let us know on our . Your feedback helps us improve and expand our file type support.

optical character recognition (OCR) model
speech to text (STT) model
Search
public roadmap