The no-nonsense RAG chunking library that’s lightweight, fast, and ready to CHONK your texts!
Ever found yourself making a RAG pipeline yet again (your 2,342,148th one), only to realize you’re stuck having to write chunking with bloated software library X or the painfully feature-less library Y? WHY CAN’T THIS JUST BE SIMPLE, UGH?
Well, look no further than Chonkie! (chonkie boi is a gud boi 🦛)
All the CHONKs you’d ever need for your RAG applications
Install, Import, CHONK - it’s that simple!
CHONK at the speed of light! zooooom
Supports all your favorite tokenizer, model and API CHONKs
No bloat, just CHONK - only 9.7MB base installation
psst it’s a pygmy hippo btw! Moto Moto approved
Get started with Chonkie in three simple steps: Install, Import and CHONK!
Want more features? :
Chonkie follows a special approach to dependencies, keeping the base installation lightweight while allowing you to add extra features as and when needed. Please check the Installation page for more details.
Release the CHONK! 🦛✨
Don’t wanna chunk locally? No problem! Chonkie Cloud is here to save the day!
Ready to learn more about Chonkie?
Learn about Chonkie’s core concepts and values
Learn about different installation options
Explore different chunking strategies
Chonkie’s hosted chunking service! 🦛☁️
Explore different embedding strategies
Star us on GitHub and contribute
Got questions? We’re here to help!
Join our Discord community for support from Chonkers all over the world!
Found a bug? Open an issue on GitHub and we’ll fix it!
Email the Chonkie team at support@chonkie.ai for any questions or feedback!