# Chonkie ## Docs - [Concepts](https://docs.chonkie.ai/common/concepts.md): Common concepts of Chonkie - [Open Source](https://docs.chonkie.ai/common/open-source.md): The Open Source Library For RAG - [🦛 Chonkie ✨](https://docs.chonkie.ai/common/welcome.md): The lightweight ingestion library for fast, efficient and robust RAG pipelines - [Docker](https://docs.chonkie.ai/oss/api/docker.md): Deploy the Chonkie API server with Docker and docker-compose - [Endpoints](https://docs.chonkie.ai/oss/api/endpoints.md): API reference for all Chonkie chunkers and refineries - [API Server](https://docs.chonkie.ai/oss/api/overview.md): Self-host Chonkie as a REST API for language-agnostic text chunking - [Pipelines](https://docs.chonkie.ai/oss/api/pipelines.md): Store and manage reusable chunking pipeline configurations - [Quick Start](https://docs.chonkie.ai/oss/api/quickstart.md): Get the Chonkie API server running in under a minute - [Changelog](https://docs.chonkie.ai/oss/changelog.md): Chonkie's Release Notes and Updates 🦛✨ - [MarkdownChef](https://docs.chonkie.ai/oss/chefs/markdownchef.md): Process markdown files, extracting tables, code blocks, and images. - [Chefs Overview](https://docs.chonkie.ai/oss/chefs/overview.md): Overview of the different chefs available in Chonkie - [TableChef](https://docs.chonkie.ai/oss/chefs/tablechef.md): Extract tables from markdown text (including HTML tables) and prepare them for future usage. - [TextChef](https://docs.chonkie.ai/oss/chefs/textchef.md): Process plain text files into Document objects. - [Code Chunker](https://docs.chonkie.ai/oss/chunkers/code-chunker.md): Split code into chunks based on code structure - [Fast Chunker](https://docs.chonkie.ai/oss/chunkers/fast-chunker.md): SIMD-accelerated text chunking at 100+ GB/s throughput - [Late Chunker](https://docs.chonkie.ai/oss/chunkers/late-chunker.md): Split text into chunks based on a late-bound token count - [Neural Chunker](https://docs.chonkie.ai/oss/chunkers/neural-chunker.md): Split text using a fine-tuned BERT model to detect semantic shifts - [Chunkers Overview](https://docs.chonkie.ai/oss/chunkers/overview.md): Overview of the different chunkers available in Chonkie - [Recursive Chunker](https://docs.chonkie.ai/oss/chunkers/recursive-chunker.md): Recursively chunk documents into smaller chunks. - [SDPM Chunker (Legacy)](https://docs.chonkie.ai/oss/chunkers/sdpm-chunker.md): Semantic Double-Pass Merging chunker - now integrated into SemanticChunker - [Semantic Chunker](https://docs.chonkie.ai/oss/chunkers/semantic-chunker.md): Split text into chunks based on semantic similarity with advanced features - [Sentence Chunker](https://docs.chonkie.ai/oss/chunkers/sentence-chunker.md): Split text into chunks while preserving sentence boundaries - [Slumber Chunker](https://docs.chonkie.ai/oss/chunkers/slumber-chunker.md): Agentic chunking powered by generative models via the Genie interface - [Table Chunker](https://docs.chonkie.ai/oss/chunkers/table-chunker.md): Split markdown or HTML tables into manageable chunks by row, preserving headers. - [TeraflopAI Chunker](https://docs.chonkie.ai/oss/chunkers/teraflopai-chunker.md): Segment text using the TeraflopAI Segmentation API - [Token Chunker](https://docs.chonkie.ai/oss/chunkers/token-chunker.md): Split text into fixed-size token chunks with configurable overlap - [AutoEmbeddings](https://docs.chonkie.ai/oss/embeddings/auto-embeddings.md): Automatically select the best embeddings handler for your use case - [AzureOpenAIEmbeddings](https://docs.chonkie.ai/oss/embeddings/azure-embeddings.md): Embed text using Azure OpenAI embeddings - [CohereEmbeddings](https://docs.chonkie.ai/oss/embeddings/cohere-embeddings.md): Embed text using Cohere embeddings - [Create your own custom embeddings handler](https://docs.chonkie.ai/oss/embeddings/custom-embeddings.md) - [GeminiEmbeddings](https://docs.chonkie.ai/oss/embeddings/gemini-embeddings.md): Embed text using Google Gemini embeddings via GenAI API - [JinaEmbeddings](https://docs.chonkie.ai/oss/embeddings/jina-embeddings.md): JinaEmbeddings is a utility for embedding chunks. - [Model2VecEmbeddings](https://docs.chonkie.ai/oss/embeddings/model2vec-embeddings.md): Embed text using Model2Vec embeddings - [OpenAIEmbeddings](https://docs.chonkie.ai/oss/embeddings/openai-embeddings.md): Embed text using OpenAI embeddings - [Embeddings Overview](https://docs.chonkie.ai/oss/embeddings/overview.md): Overview of the different embeddings available in Chonkie - [SentenceTransformerEmbeddings](https://docs.chonkie.ai/oss/embeddings/sentence-transformer-embeddings.md): Embed text using SentenceTransformer embedding models - [VoyageAIEmbeddings](https://docs.chonkie.ai/oss/embeddings/voyageai-embeddings.md): Embed text using VoyageAI embeddings - [CLI](https://docs.chonkie.ai/oss/experimental/chonkie-cli.md): Chonkie Command Line Interface - [Code Chunker](https://docs.chonkie.ai/oss/experimental/code-chunker.md): Advanced AST-based code chunking with intelligent semantic preservation - [Overview](https://docs.chonkie.ai/oss/experimental/overview.md): Explore cutting-edge chunking capabilities with Chonkie's experimental features - [FileFetcher](https://docs.chonkie.ai/oss/fetchers/file-fetcher.md): Fetch files from local filesystem for pipeline processing - [Fetchers Overview](https://docs.chonkie.ai/oss/fetchers/overview.md): Overview of the different fetchers available in Chonkie - [Chroma Handshake](https://docs.chonkie.ai/oss/handshakes/chroma-handshake.md): Export Chonkie's Chunks into a Chroma collection. - [Elasticsearch Handshake](https://docs.chonkie.ai/oss/handshakes/elastic-handshake.md): Export Chonkie's Chunks into an Elasticsearch index. - [LanceDB Handshake](https://docs.chonkie.ai/oss/handshakes/lancedb-handshake.md): Export Chonkie's Chunks into a LanceDB table. - [Milvus Handshake](https://docs.chonkie.ai/oss/handshakes/milvus-handshake.md): Export Chonkie's Chunks into a Milvus collection. - [MongoDB Handshake](https://docs.chonkie.ai/oss/handshakes/mongodb-handshake.md): Export Chonkie's Chunks into a MongoDB collection. - [Handshakes Overview](https://docs.chonkie.ai/oss/handshakes/overview.md): Overview of the different handshakes available in Chonkie - [Pgvector Handshake](https://docs.chonkie.ai/oss/handshakes/pgvector-handshake.md): Export Chonkie's Chunks into a PostgreSQL database with pgvector. - [Pinecone Handshake](https://docs.chonkie.ai/oss/handshakes/pinecone-handshake.md): Export Chonkie's Chunks into a Pinecone index. - [Qdrant Handshake](https://docs.chonkie.ai/oss/handshakes/qdrant-handshake.md): Export Chonkie's Chunks into a Qdrant collection. - [Turbopuffer Handshake](https://docs.chonkie.ai/oss/handshakes/turbopuffer-handshake.md): Export Chonkie's Chunks into a Turbopuffer database. - [Weaviate Handshake](https://docs.chonkie.ai/oss/handshakes/weaviate-handshake.md): Export Chonkie's Chunks into a Weaviate collection. - [Installation](https://docs.chonkie.ai/oss/installation.md): Installing Chonkie and its various components - [Building Pipelines](https://docs.chonkie.ai/oss/pipelines.md): Build powerful text processing workflows with Chonkie's Pipeline API - [DatasetsPorter](https://docs.chonkie.ai/oss/porters/datasets-porter.md): Export Chonkie's Chunks into a Hugging Face Dataset. - [JSONPorter](https://docs.chonkie.ai/oss/porters/json-porter.md): Export Chonkie's Chunks into a JSON file. - [Porters Overview](https://docs.chonkie.ai/oss/porters/overview.md): Overview of the different porters available in Chonkie - [Get Started with Chonkie](https://docs.chonkie.ai/oss/quick-start.md): Get started with Chonkie - [Embeddings Refinery](https://docs.chonkie.ai/oss/refinery/embeddings-refinery.md): Embed Chunked Texts - [Overlap Refinery](https://docs.chonkie.ai/oss/refinery/overlap-refinery.md): Refine chunks by adding overlapping context from adjacent chunks. - [Refinery Overview](https://docs.chonkie.ai/oss/refinery/overview.md): Overview of the different refinery available in Chonkie - [Hubbie](https://docs.chonkie.ai/oss/utils/hubbie.md): Hubbie is a utility for accessing Chonkie's saved recipes. - [Logging](https://docs.chonkie.ai/oss/utils/logging.md): Control Chonkie's log output - [Visualizer](https://docs.chonkie.ai/oss/utils/visualizer.md): Visualize your chunks and embeddings ## Optional - [Discord](https://discord.gg/Q6zkP8w6ur) - [Contact](mailto:support@chonkie.ai)