Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.clarifeye.ai/llms.txt

Use this file to discover all available pages before exploring further.

Initial data

When you create a new warehouse in Clarifeye, some data transformation is automatically performed on the uploaded data. To store that data, Clarifeye will create 3 tables:
  • Parsed Documents Table - The raw result of the parsing.
  • Blocks Table - Atomic data extracted from the documents
  • Chunks Table - Chunks of the documents
For each of these tables, a dedicated viewer will be created to allow you to explore the data.

Overriding the default behaviors

You can change the default parameters for each of these tasks.

Changing the parameters in the warehouse settings

You can customize for you warehouse:
  • Parameters of the parsing (in warehouse settings > parsing)
  • Parameters of the chunking (in extractor settings > chunks extractor)

Create your own pipelines

You can also leverage Clarifeye python client to create your own pipelines. To do so you will be able to use the:
  • retrieval API (to get data from a table)
  • Write Data API (to write data to a table)