LLM
Catalog
MCP Servers
- mcphost - A CLI host application that enables Large Language Models (LLMs) to interact with external tools through the Model Context Protocol (MCP).
Vendors
Google
OpenAI
Tools
- TokenCost - Easy token price estimates for 400+ LLMs. TokenOps
- agents.md - A simple, open format for guiding coding agents, used by over 20k open-source projects.
- aibrix - Cost-efficient and pluggable Infrastructure components for GenAI inference
- bifrost - Fastest LLM gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.
- mistral.rs - Blazingly fast LLM inference
- ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models
Status Pages
Models
| Creator | Name | Hugging Face | Ollama |
|---|---|---|---|
| Alibaba | Qwen 3 VL | Ollama | |
| BAAI | bge-m3 | HF | Ollama |
| EuroLLM | EuroLLM | HF | |
| embeddinggemma | Ollama | ||
| gemma3 | Ollama | ||
| gemma3n | Ollama | ||
| vaultgemma | HF | ||
| Lumi | Viking | HF | |
| Mistral AI | mistral | Ollama | |
| SCB 10X | typhoon-ocr-3b | Ollama | |
| SCB 10X | typhoon-translate-4b | Ollama | |
| TildeAI | Tilde | HF |
Demo
Apps
- gallery - A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.
Resources
- Artificial Analysis - AI Model & API Providers Analysis
- LLM Pricing
- LLM Explorer
- Killed by LLM
- The Ultra-Scale Playbook: Training LLMs on GPU Clusters