Skip to content

AstraDB

Configure your own AstraDB instance as data sink.

astradb

```python from llama_cloud.types import CloudAstraDBVectorStore

ds = { ‘name’: '', ‘sink_type’: ‘ASTRA_DB’, ‘component’: CloudAstraDBVectorStore( token='', api_endpoint='', collection_name='', embedding_dimension=1536, # Length of embedding vectors keyspace='', # optional (default: ‘default_keyspace’) ) } data_sink = client.data_sinks.create_data_sink(request=ds)

</TabItem>
<TabItem value="typescript" label="TypeScript Client" default>
```Typescript
const ds = {
'name': 'astradb',
'sinkType': 'ASTRA_DB',
'component': {
'token': '<astra-db-application-token>',
'api_endpoint': '<astra-db-api-endpoint>',
'collection_name': '<collection-name>',
'embedding_dimension': 1536, // Length of embedding vectors
'keyspace': '<keyspace-name>', // optional (default: 'default_keyspace')
}
}
data_sink = await client.dataSinks.createDataSink({
projectId: projectId,
body: ds
})
ParameterTypeRequiredDescription
tokenstringYesThe Astra DB Application Token to use for authentication
api_endpointstringYesThe Astra DB JSON API endpoint for your database
collection_namestringYesCollection name to use. If not existing, it will be created
embedding_dimensionintegerYesLength of the embedding vectors in use (e.g., 1536 for OpenAI)
keyspacestringNoThe keyspace to use. If not provided, ‘default_keyspace’ will be used

Before configuring AstraDB as a data sink, ensure you have:

  1. AstraDB Database: A running AstraDB database instance
  2. Application Token: An AstraDB Application Token with appropriate permissions
  3. API Endpoint: The JSON API endpoint URL for your database
  4. Keyspace: A keyspace in your database (optional, will use ‘default_keyspace’ if not specified)
  • Visit the AstraDB Console
  • Create a new database or use an existing one
  • Note down your database’s API endpoint
  • In the AstraDB Console, navigate to your database
  • Go to the “Connect” tab
  • Generate an Application Token with the necessary permissions
  • Save the token securely

Use the token and API endpoint in your data sink configuration as shown in the examples above.

When using AstraDB as a data sink, you can apply filters using standard MongoDB-style query operators:

Filter OperatorDescription
$eqEquals
$neNot equal
$gtGreater than
$ltLess than
$gteGreater than or equal
$lteLess than or equal
$inValue is in a list
$ninValue is not in a list

These filters can be applied to metadata fields when querying your AstraDB collection to refine search results based on specific criteria.