Stars
Apache Pulsar - distributed pub-sub messaging system
AstraPy is a Pythonic interface for DataStax Astra DB and the Data API
Lightweight fast function pipeline (DAG) creation in pure Python for scientific (HPC) workflows 🕸️🧪
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
The official Python client for the Hugging Face Hub.
Open-Source Web UI for managing Apache Kafka clusters
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
DuckDB is an analytical in-process SQL database management system
Python Interface for the Popular mermaid-js Library, Simplified for Diagram Creation
A lightweight and powerful JSONPath implementation for Python.
🌐 The easiest way to parse and modify URLs in Python.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
This is a repo with links to everything you'd ever want to learn about data engineering
A collection of creative AnyWidgets for Python notebook environments
k8s operator and plugin for marimo deployment