LLM App Stack

February 15, 2024 · View on GitHub

aka Emerging Architectures for LLM Applications

2657 Emerging LLM App Stack R2 Clean

This is a list of available tools, projects, and vendors at each layer of the LLM app stack.

Our original article included only the most popular options, based on user interviews. This repo is meant to be more comprehensive, covering all available options in each category. We probably still missed some important projects, so please open a PR if you see anything missing.

We also included Perplexity and Cursor.sh prompts to make searching and markdown table formatting easier.

Table of Contents

  1. Data Pipelines
  2. Embedding Models
  3. Vector Databases
  4. Playgrounds
  5. Orchestrators
  6. APIs / Plugins
  7. LLM Caches
  8. Logging / Monitoring / Eval
  9. Validators
  10. LLM APIs (proprietary)
  11. LLM APIs (open source)
  12. App Hosting Platforms
  13. Cloud Providers
  14. Opinionated Clouds

Project List

Data Pipelines

Name (site)DescriptionGithubPip Installs
DatabricksA unified data platform for building, deploying, and maintaining enterprise data solutions, including products (like MosaicML and MLflow) purpose-built for AI<a href=https://github.com/apache/spark><a href=https://pypi.org/project/pyspark>
AirflowA data pipeline framework to programmatically author, schedule, and monitor data pipelines and workflows, including for LLMs<a href=https://github.com/apache/airflow><a href=https://pypi.org/project/apache-airflow>
Unstructured.ioOpen-source components for pre-processing documents such as PDFs, HTML, and Word documents for usage with LLM apps<a href=https://github.com/Unstructured-IO/unstructured><a href=https://pypi.org/project/unstructured>
FivetranA platform that extracts, loads, and transforms data from various sources for analytics, AI, and operationsN/A<a href=https://pypi.org/project/fivetran>
AirbyteAn open-source data integration engine that helps consolidate data in data warehouses, lakes, and databases<a href=https://github.com/airbytehq/airbyte><a href=https://pypi.org/project/airbyte-cdk>
AnyscaleAn AI compute platform that allows developers to scale data ingest, preprocessing, embedding, and inference computations using Ray<a href=https://github.com/ray-project/ray><a href=https://pypi.org/project/ray>
AlluxioAn open-source data platform at the intersection of compute and storage, bringing data closer to compute, to accelerate model training and serving, boost GPU utilization, and reduce costs for AI workloads<a href=https://github.com/Alluxio/alluxio> GitHub Repo stars<a href=https://pypi.org/project/alluxio-python-library/>PyPI - Downloads

^ Back to Contents ^

Embedding Models

Name (site)DescriptionGithubPip Installs
OpenAI Ada Embedding 2OpenAI's most popular embedding model for capturing semantic relationships in textn/a<a href=https://pypi.org/project/openai>
Cohere AIAn independent commerical provider of LLMs, with particular focus on embeddings for semantic search, topic clustering, and vertical applications<a href=https://github.com/cohere-ai/notebooks><a href=https://pypi.org/project/cohere>
Sentence TransformersAn open-source Python framework for sentence, text, and image embeddings<a href=https://github.com/UKPLab/sentence-transformers><a href=https://pypi.org/project/sentence-transformers>

^ Back to Contents ^

Vector Databases

Name (site)DescriptionGithubPip Installs
PineconeA managed, cloud-native vector database with a simple API for high-performance AI applicationsn/a<a href=https://pypi.org/project/pinecone-client>
WeaviateAn open-source vector database that stores both objects and vectors<a href=https://github.com/weaviate/weaviate><a href=https://pypi.org/project/weaviate-client>
ChromaDBAn AI-native, open-source embedding database platform for developers<a href=https://github.com/chroma-core/chroma><a href=https://pypi.org/project/chromadb>
PgvectorAn open-source vector similarity search for Postgres, allowing for exact and approximate nearest-neighbor search<a href=https://github.com/pgvector/pgvector><a href=https://pypi.org/project/pgvector>
Zilliz (Milvus)An open-source vector database, built for developing and maintaining AI applications<a href=https://github.com/milvus-io/milvus><a href=https://pypi.org/project/pymilvus>
QdrantA vector database and vector similarity search engine<a href=https://github.com/qdrant/qdrant><a href=https://pypi.org/project/qdrant-client>
Metal ioA managed service for developers to build applications with ML embeddingsN/A<a href=https://pypi.org/project/metal-python>
LanceDBA serverless vector database for AI applications<a href=https://github.com/lancedb/lancedb><a href=https://pypi.org/project/lancedb>

^ Back to Contents ^

Playgrounds

Name (site)DescriptionGithubPip Installs
OpenAI PlaygroundA web-based platform for experimenting with various machine-learning models developed by OpenAIN/AN/A
nat.devA platform that allows users to test prompts with multiple language models and compare their performance<a href=https://github.com/nat/openplayground><a href=https://pypi.org/project/openplayground>
HumanloopA platform that helps developers build applications on top of LLMs<a href=https://github.com/humanloop/humanloop-tutorial-python><a href=https://pypi.org/project/humanloop>
Parea AIPlatform and SDK for AI Engineers providing tools for LLM evaluation, observability, and a version-controlled enhanced prompt playground.<a href=https://github.com/parea-ai><a href=https://pypi.org/project/parea-ai/>

^ Back to Contents ^

Orchestrators

Name (site)DescriptionGithubPip Installs
LangchainAn open-source library that gives developers the tools to build applications powered by LLMs<a href=https://github.com/langchain-ai/langchain><a href=https://pypi.org/project/langchain>
LlamaIndexA data framework for LLM applications to ingest, structure, and access private or domain-specific data<a href=https://github.com/jerryjliu/llama_index><a href=https://pypi.org/project/llama-index>
AutogenA framework for automating and streamlining LLM workflows using customizable, conversable agents for complex AI applications<a href=https://github.com/microsoft/autogen><a href=https://pypi.org/project/pyautogen>
Microsoft Semantic KernelA lightweight open-source orchestration SDK<a href=https://github.com/microsoft/semantic-kernel><a href=https://pypi.org/project/semantic-kernel>
HaystackLLM orchestration framework to build customizable, production-ready LLM applications<a href=https://github.com/deepset-ai/haystack><a href=https://pypi.org/project/farm-haystack/>
Vercel AI SDKAn open-source library for developers to build streaming UIs in JavaScript and TypeScript<a href=https://github.com/vercel/ai><a href=https://pypi.org/project/vercel-ai-sdk>(node/npm)
Vectara AIA search and discovery platform for AI conversations utilizing your own data<a href=https://github.com/vectara/vectara-ingest>N/A
ChatGPTAn AI chatbot that uses natural language processing to create humanlike conversational dialogueN/AN/A

^ Back to Contents ^

APIs / Plugins

Name (site)DescriptionGithubPip Installs
Serp APIA real-time API to access Google search results, as well as handling proxies, solving captchas, and parsing structured data<a href=https://github.com/serpapi/google-search-results-python><a href=https://pypi.org/project/google-search-results>
Wolfram Alpha APIA web-based API providing computational and presentation capabilities for integration into various applicationsN/A<a href=https://pypi.org/project/wolframalpha>
Zapier API AI PluginA plugin that allows you to connect 5,000+ apps and interact with them directly inside ChatGPTN/AN/A

^ Back to Contents ^

LLM Caches

Name (site)DescriptionGithubPip Installs
RedisAn in-memory data structure store used as a database, cache, message broker, and streaming engine<a href=https://github.com/redis/redis><a href=https://pypi.org/project/redis/>
SQLiteA self-contained, serverless, zero-configuration, transactional SQL database engine<a href=https://github.com/sqlite/sqlite><a href=https://pypi.org/project/pysqlite3/>
GPTCacheAn open-source tool for improving the efficiency and speed of GPT-based applications by implementing a cache to store the responses<a href=https://github.com/zilliztech/GPTCache>N/A

^ Back to Contents ^

Logging / Monitoring / Eval

Name (site)DescriptionGithubPip Installs
Braintrust DataAn AI product stack featuring evaluations, prompt playgrounds, continuous integration, dataset management, and access to various AI models through a single API<a href=https://github.com/braintrustdata/braintrust-proxy><a href=https://pypi.org/project/braintrust>
Arize AIAn observability platform for both LLMs and supervised ML<a href=https://github.com/Arize-ai><a href=https://pypi.org/project/arize>
Weights & BiasesAn MLOps platform for streamlining ML workflows<a href=https://github.com/wandb/wandb><a href=https://pypi.org/project/wandb>
MLflowA platform to streamline ML development<a href=https://github.com/mlflow/mlflow><a href=https://pypi.org/project/mlflow>
PromptLayerA platform for tracking, managing, and sharing LLM prompt engineering<a href=https://github.com/MagnivOrg/prompt-layer-library><a href=https://pypi.org/project/promptlayer>
HeliconeAn open-source observability platform for LLMs<a href=https://github.com/Helicone/helicone><a href=https://pypi.org/project/helicone>
Quotient AIQuotient AI is a platform for evaluating AI products on real-world use-cases, during research, development, and in productionN/AN/A
Portkey AIA platform to develop, launch, maintain, and iterate generative AI apps and featuresN/AN/A
Freeplay AIA platform to prototype, test, and optimize LLM features for customersN/AN/A
GentraceAn API and SDKs for evaluating and observing generative data, with features like AI, heuristic, and human grading evaluations, as well as production data observationN/A<a href=https://pypi.org/project/gentrace-py>
Patronus AIAn automated evaluation and benchmarking platform for LLMs, providing tools for testing, scoring, and evaluating LLMs in real-world scenariosN/AN/A
Autoblocks AIA collaborative cloud-based workspace designed for rapid iteration on GenAI products, offering features like prompt management, observability, continuous evaluations, fine-tuning, prototyping, debugging, and scalable data ingestion & search, all in a provider-agnostic environmentN/A<a href=https://pypi.org/project/autoblocksai>
Context AITools for pre-launch LLM evaluations and post-launch analytics, with features such as testing, performance monitoring, user conversation analysis, and support for various models and librariesN/A<a href=https://pypi.org/project/context-python>
E2b devServices to deploy, test, and monitor AI agents, including a sandbox with a secure, long-running cloud environment for various LLMs with features like internet access<a href=https://github.com/e2b-dev/e2b><a href=https://pypi.org/project/e2b>
AgentopsToolkit for evaluating and developing AI agents, providing tools for agent development, monitoring capabilities, and replay analytics<a href=https://github.com/AgentOps-AI/agentops><a href=https://pypi.org/project/agentops>
ZenomlAI evaluation platform that enables data visualization, model performance analysis, and the creation of interactive reports for various data types<a href=https://github.com/zeno-ml/zeno-build><a href=https://pypi.org/project/zeno-client>
BaserunTools for model configuration, prompt playground, monitoring, and prototype workflow, as well as features for full visibility into LLM workflows and end-to-end testing<a href=https://github.com/baserun-ai/baserun-py><a href=https://pypi.org/project/baserun>
WhyLabsAI Observability platform for ML and GenAI including LLM monitoring, guardrails and security<a href=https://github.com/whylabs><a href=https://pypi.org/project/whylabs-client>
Log10AI-powered LLMOps platform that automatically optimizes prompts and models with built-in logging, debugging, metrics, feedback, evaluations and fine-tuning<a href=https://github.com/log10-io/log10>
<a href=https://pypi.org/project/log10-io>
<a href=https://pypi.org/project/llmeval>
promptfooOpen-source LLM eval framework with support for model/prompt/RAG eval, dataset generation, local models, and self-hosting.<a href=https://github.com/promptfoo/promptfoo><a href=https://www.npmjs.com/package/promptfoo> (node/npm)
Parea AIPlatform and SDK for AI Engineers providing tools for LLM evaluation, observability, and a version-controlled enhanced prompt playground.<a href=https://github.com/parea-ai><a href=https://pypi.org/project/parea-ai/>
GalileoGalileo is a platform for evaluation, fine-tuning and real-time observability, powered by high-accuracy hallucination guardrails.N/AN/A

^ Back to Contents ^

Validators

Name (site)DescriptionGithubPip Installs
Guardrails AIAn open-source Python package for specifying structure and type, validating, and correcting the outputs of LLMs<a href=https://github.com/ShreyaR/guardrails><a href=https://pypi.org/project/guardrails-ai>
RebuffAn open-source framework designed to detect and protect against prompt injection attacks in LLM apps<a href=https://github.com/woop/rebuff><a href=https://pypi.org/project/rebuff>
Microsoft GuidanceA guidance language for controlling LLMs, providing a syntax for architecting LLM workflows<a href=https://github.com/microsoft/guidance><a href=https://pypi.org/project/guidance>
LMQLAn open-source programming language and platform for language model interaction<a href=https://github.com/eth-sri/lmql><a href=https://pypi.org/project/lmql>
OutlinesA tool for helping developers guide text generation to build robust interfaces with external systems and guarantee that outputs match a regex or JSON schema<a href=https://github.com/outlines-dev/outlines><a href=https://pypi.org/project/outlines>
LLM GuardAn open-source, comprehensive tool designed to fortify the security of Large Language Models (LLMs).<a href=https://github.com/laiyer-ai/llm-guard><a href=https://pypi.org/project/llm-guard>

^ Back to Contents ^

LLM APIs (proprietary)

Name (site)DescriptionGithubPip Installs
OpenAIA company providing many leading LLMs, including the GPT-3.5 and GPT-4 familiesN/A<a href=https://pypi.org/project/openai>
AnthropicThe developer of Claude, an AI assistant based on Anthropic’s researchN/A<a href=https://pypi.org/project/anthropic>
Cohere AIAn LLM vendor with particular focus on embeddings for semantic search, topic clustering, and vertical applications<a href=https://github.com/cohere-ai/notebooks><a href=https://pypi.org/project/cohere>
LLMA CLI utility and Python library for interacting with Large Language Models, both via remote APIs and models that can be installed and run on your own machine.<a href=https://github.com/simonw/llm><a href=https://pypi.org/project/llm/>

^ Back to Contents ^

LLM APIs (open source)

Name (site)DescriptionGithubPip Installs
Hugging FaceA hub for open-source AI models and inference endpoints, including leading base LLMs and LoRAs/fine-tunes<a href=https://github.com/huggingface/transformers><a href=https://pypi.org/project/transformers>
ReplicateAn AI hosting platform and model inference hub that allows software developers to integrate AI models into their apps<a href=https://github.com/replicate/cog><a href=https://pypi.org/project/replicate>
AnyscaleAn AI API and compute platform that allows developers to scale inference, training, and embedding computations with any model using Ray<a href=https://github.com/ray-project/ray><a href=https://pypi.org/project/ray>
OllamaGet up and running with large language models locally<a href=https://github.com/ollama/ollama><a href=https://pypi.org/project/ollama/>
GPT4ALLAn ecosystem of open-source on-edge large language models.<a href=https://github.com/nomic-ai/gpt4all><a href=https://pypi.org/project/gpt4all/>

^ Back to Contents ^

App Hosting Platforms

Name (site)Description
VercelA cloud platform designed for front-end engineers, built with first-class support for LLM apps
NetlifyAn enterprise cloud computing company that offers a development platform for web applications and dynamic websites
SteamshipAn SDK and hosting platform for AI agents and tools, both a package manager and package hosting service for AI
StreamlitAn open-source Python library designed for creating and sharing custom web apps for ML and data science
ModalA platform that enables running distributed applications using the modal Python package

^ Back to Contents ^

Cloud Providers

Name (site)Description
Amazon Web ServicesA cloud computing platform, offering services from data centers globally
Google Cloud PlatformA cloud computing platform, offering services from data centers globally
Microsoft AzureA cloud computing platform, offering services from data centers globally
CoreWeaveA specialized cloud provider that delivers GPUs on top of flexible deployment infrastructure

^ Back to Contents ^

Opinionated Clouds

Name (site)Description
Databricks (MosaicML)Databricks acquired Mosaic ML in 2023, along with its tooling and platform for efficient pre-trainining, fine-tuning and inferencing LLMs
AnyscaleAn AI compute platform that enables developers to scale inference, training, and embedding computations with any model using Ray
ModalA platform that eables running distributed applications using the Modal Python package
RunpodA cloud computing platform designed for AI and ML applications
OctoMLA compute service that allows users to run, tune, and scale generative models
BasetenA inference service that allows users to deploy, serve, and scale custom and open-source models
E2BSecure sandboxed cloud environments made for AI agents and AI apps

^ Back to Contents ^




Formatting Prompt Templates

We were able to partialy automate this - particularly finding Github and PyPI links - using this Perplexity search prompt. It worked roughly ~75% of the time and could handle ~3 projects at a time, pulling data from 20-30 sources in each iteration.

image

Once you have the data you would like to add, if you don't want deal with the markdown formatting here, it is easy to correctly format using a tool like Cursor.

See the prompt below that works as an inline edit, just make sure you highlight 4-5 previous examples so Cursor can infer the format itself:

image

^ Back to Contents ^