Skip to content

URL Importer

Imports data from a URL.

Default

Usage

cdp imp url <url> [flags]

Get Help

Get help for the command with the following flag:

cdp imp url --help

URLs with binary data

Currently the URL importer does not support URLs with binary data, such as PDFs or other binary files. Need this? Raise an issue here.

The following example imports data from ChromaDB documentation with max depth 2.

cdp imp url https://docs.trychroma.com/embeddings -d 2

Advanced Usage

The following example imports data from ChromaDB documentation with max depth 3, chunks the data into 512 byte chunks, cleans the data of emojis, and embeds the data using the default embedding function.

cdp imp url https://docs.trychroma.com/ -d 3 | cdp chunk -s 512| cdp tx emoji-clean -m | cdp embed --ef default