In this quick start guide we show how to setup open source kafka and Zilliz Cloud to ingest vector data.
Complete the following steps to download the kafka-connect-milvus plugin.
- download the latest plugin zip file
zilliz-kafka-connect-milvus-xxx.zip
from here.
- Download the latest kafka from here.
- Unzip the downloaded file and go to the kafka directory.
$ tar -xzf kafka_2.13-3.6.1.tgz
$ cd kafka_2.13-3.6.1
NOTE: Your local environment must have Java 8+ installed.
Run the following commands in order to start all services in the correct order:
- Start the ZooKeeper service
$ bin/zookeeper-server-start.sh config/zookeeper.properties
Open another terminal session and run:
- Start the Kafka broker service
$ bin/kafka-server-start.sh config/server.properties
Once all services have successfully launched, you will have a basic Kafka environment running and ready to use.
- check the official quick start guide form kafka for details: https://kafka.apache.org/quickstart
Ensure you have Kafka and Zilliz Cloud setup and properly configured.
- If you don't already have a topic in Kafka, create a topic (e.g.
topic_0
) in Kafka.
$ bin/kafka-topics.sh --create --topic topic_0 --bootstrap-server localhost:9092
- If you don't already have a collection in Zilliz Cloud, create a collection with a vector field (in this example the vector has
dimension=8
). You can use the following example schema on Zilliz Cloud:
Note: Make sure the schema on both sides match each other. In the schema, there is exactly one vector field. The names of each field on both sides are exactly the same.
- unzip the
zilliz-kafka-connect-milvus-xxx.zip
file you downloaded in Step 1. - copy the
zilliz-kafka-connect-milvus
directories to thelibs
directory of your Kafka installation. - modify the
connect-standalone.properties
file in theconfig
directory of your Kafka installation.key.converter.schemas.enable=false value.converter.schemas.enable=false plugin.path=libs/zilliz-kafka-connect-milvus-xxx
- create and configure a
milvus-sink-connector.properties
file in theconfig
directory of your Kafka installation.name=zilliz-kafka-connect-milvus connector.class=com.milvus.io.kafka.MilvusSinkConnector public.endpoint=https://<public.endpoint>:port token=***************************************** collection.name=topic_0 topics=topic_0
- Start the connector with the previous configuration file
$ bin/connect-standalone.sh config/connect-standalone.properties config/milvus-sink-connector.properties
- Try produce a message to the Kafka topic you just created in Kafka
bin/kafka-console-producer.sh --topic topic_0 --bootstrap-server localhost:9092
>{"id": 0, "title": "The Reported Mortality Rate of Coronavirus Is Not Important", "title_vector": [0.041732933, 0.013779674, -0.027564144, -0.013061441, 0.009748648, 0.00082446384, -0.00071647146, 0.048612226], "link": "https://medium.com/swlh/the-reported-mortality-rate-of-coronavirus-is-not-important-369989c8d912"}
- Check if the entity has been inserted into the collection in Zilliz Cloud. Here is what it looks like on Zilliz Cloud if the insertion succeeds:
If you require any assistance or have questions regarding the Kafka Connect Milvus Connector, please feel free to reach out to our support team: Email: [email protected]