A specialized storage system designed to efficiently handle and query high-dimensional vector data, enabling similarity search and AI applications
-
Qdrant - Open-source vector database written in Rust offering high-performance similarity search with cloud-native scalability
-
Pinecone - Serverless vector database platform optimized for machine learning applications with enterprise-grade security
-
Weaviate - Fast, flexible AI-native vector database with built-in model serving and multi-tenant capabilities
-
PGVector - PostgreSQL extension that enables vector similarity search with ACID compliance and SQL integration
-
Supabase - Postgres-based platform that includes vector search capabilities alongside other database features
Frameworks and platforms that enable efficient deployment and serving of large language models, optimizing for performance and scalability in production environments
-
vLLM - High-performance inference engine using PagedAttention for optimal serving throughput
-
Ollama - Framework for running and serving large language models locally with cross-platform support
-
LM Studio - Local LLM development environment with built-in model management and serving capabilities
-
Groq - Cloud platform offering ultra-fast LLM inference with specialized hardware acceleration
-
MLX - Apple's machine learning framework optimized for Apple Silicon
-
KServe - Kubernetes-based model serving platform supporting multiple frameworks and auto-scaling
Software infrastructures that support the creation and management of autonomous agents, providing tools for development, deployment, and interaction between AI agents
-
LangChain - Comprehensive framework for building and connecting LLM-powered applications
-
LangGraph - Framework specialized in creating stateful, multi-agent workflows
-
LlamaIndex - Framework for building RAG applications with data connection capabilities
-
AutoGen/Magnetic-One - Microsoft's framework enabling sophisticated multi-agent collaboration patterns
-
CrewAI - Platform for creating role-based AI agent teams with specialized tasks and collaboration
Software components that enable continuous integration, delivery, and automation of machine learning workflows, including model training, testing, and deployment
-
Apache Kafka - Distributed streaming platform for building real-time data pipelines
-
Google Cloud MLOps - Comprehensive suite of tools for ML model deployment and management
-
Databricks MLflow - End-to-end platform for managing the ML lifecycle with experiment tracking
-
Kubeflow - Kubernetes-native platform for deploying ML workflows
-
Delta Lake - Storage layer that brings ACID transactions to data lakes
Platforms and solutions that provide visibility into AI system performance, helping detect issues, analyze behavior, and ensure reliability of AI deployments
-
Arize Phoenix - Open-source platform specifically designed for LLM observability and evaluation
-
Evidently - Tool for ML model monitoring and evaluation in production
-
Seldon - Enterprise platform for deploying and monitoring ML models at scale
- Qdrant: https://qdrant.tech/
- Pinecone: https://www.pinecone.io/
- Weaviate: https://weaviate.io/
- PGVector: https://github.com/pgvector/pgvector
- Supabase: https://supabase.com/
- LLM Serving Frameworks/Tools
- vLLM: https://github.com/vllm-project/vllm
- Ollama: https://ollama.ai/
- LM Studio: https://lmstudio.ai/
- Groq: https://groq.com/
- MLX (Apple ML Ecosystem): https://developer.apple.com/machine-learning/
- KServe: https://kserve.github.io/
- Agent Development Frameworks
- LangChain: https://www.langchain.com/
- LlamaIndex: https://llamaindex.ai/
- AutoGen (Magnetic-One): https://github.com/microsoft/autogen
- CrewAI: https://crew.ai/
- MLOps Pipeline Tools
- Apache Kafka: https://kafka.apache.org/
- Google Cloud MLOps (Vertex AI): https://cloud.google.com/vertex-ai
- Databricks MLflow: https://mlflow.org/
- Kubeflow: https://www.kubeflow.org/
- Delta Lake: https://delta.io/
- Monitoring and Observability
- Arize Phoenix: https://github.com/Arize-ai/phoenix or https://www.arize.com/phoenix
- Evidently: https://evidentlyai.com/
- Seldon: https://www.seldon.io/