Defaults

Initial data

When you create a new warehouse in Clarifeye, some data transformation is automatically performed on the uploaded data. To store that data, Clarifeye will create 3 tables:

Parsed Documents Table - The raw result of the parsing.
Blocks Table - Atomic data extracted from the documents
Chunks Table - Chunks of the documents

For each of these tables, a dedicated viewer will be created to allow you to explore the data.

Overriding the default behaviors

You can change the default parameters for each of these tasks.

Changing the parameters in the warehouse settings

You can customize for you warehouse:

Parameters of the parsing (in warehouse settings > parsing)
Parameters of the chunking (in extractor settings > chunks extractor)

Create your own pipelines

You can also leverage Clarifeye python client to create your own pipelines. To do so you will be able to use the:

retrieval API (to get data from a table)
Write Data API (to write data to a table)

Overview

Python Client

REST API

Initial data

Overriding the default behaviors

Changing the parameters in the warehouse settings

Create your own pipelines

Overview

Python Client

REST API

Documentation Index

​Initial data

​Overriding the default behaviors

​Changing the parameters in the warehouse settings

​Create your own pipelines

Initial data

Overriding the default behaviors

Changing the parameters in the warehouse settings

Create your own pipelines