Chonkie Documentation

POST

chunk

late

curl --request POST \
  --url https://api.chonkie.ai/v1/chunk/late \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form embedding_model=all-MiniLM-L6-v2 \
  --form chunk_size=512 \
  --form recipe=default \
  --form lang=en \
  --form min_characters_per_chunk=24

[
  {
    "text": "<string>",
    "start_index": 123,
    "end_index": 123,
    "token_count": 123,
    "embedding": [
      123
    ]
  }
]

Authorizations

Authorization

string

header

required

Your API Key from the Chonkie Cloud dashboard

Body

multipart/form-data

Response

200 - application/json

Successful Response: A list of LateChunk objects.

A list containing LateChunk objects, detailing segments, sentences, and an optional chunk-level embedding derived from the full document.

SDPM Chunker

Code Chunker

curl --request POST \
  --url https://api.chonkie.ai/v1/chunk/late \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form embedding_model=all-MiniLM-L6-v2 \
  --form chunk_size=512 \
  --form recipe=default \
  --form lang=en \
  --form min_characters_per_chunk=24

[
  {
    "text": "<string>",
    "start_index": 123,
    "end_index": 123,
    "token_count": 123,
    "embedding": [
      123
    ]
  }
]

API Reference

Late Chunker

Authorizations

Body

Response