TeraflopAIChunker uses the TeraflopAI Segmentation API to split text into semantically meaningful segments. It is especially useful for domain-specific segmentation such as legal documents.
Installation
TeraflopAI Chunker requires theteraflopai Python package:
For general installation instructions, see the Installation
Guide.
Initialization
Parameters
An existing TeraflopAI client instance. If provided,
url and api_key are
ignored.The URL for the TeraflopAI segmentation API endpoint.
The API key for authentication. If not provided, it will be read from the
TERAFLOPAI_API_KEY environment variable.The tokenizer used to compute token counts for returned chunks.
Usage
Single Text Chunking
Batch Chunking
Using with Environment Variable
How It Works
- The text is sent to the TeraflopAI Segmentation API endpoint.
- The API returns a list of text segments.
- Each segment is converted into a Chonkie
Chunkobject with properstart_index,end_index, andtoken_countfields.
The TeraflopAI Segmentation API performs the segmentation on the server side.
This chunker requires an active internet connection and a valid API key.
