ElixirLLM

One beautiful API for all LLMs. Chat with GPT-4, Claude, Llama, and more—using idiomatic Elixir.

No more juggling different APIs. No more provider lock-in. Just clean, pipe-friendly code that works everywhere.

{:ok, response} = ElixirLLM.chat("What is Elixir?")

Inspired by the wonderful RubyLLM.

Demo

{:ok, response, _chat} =
  ElixirLLM.new()
  |> ElixirLLM.model("gpt-4o")
  |> ElixirLLM.tool(MyApp.Tools.Weather)
  |> ElixirLLM.ask("What&#39;s the weather in Tokyo?")

# Tool called: get_weather(%{city: "Tokyo"})
# Tool result: %{temperature: 18, conditions: "partly cloudy"}

response.content
#=> "It&#39;s currently 18°C and partly cloudy in Tokyo."

Why ElixirLLM?

Every AI provider has their own API. Different formats. Different conventions. Different headaches.

ElixirLLM gives you one consistent interface for all of them:

Feature	What it means
Unified API	Same code works with GPT-4, Claude, Llama, and more
Pipe-friendly	Idiomatic Elixir with chainable configuration
Tools that just work	Define once, automatic execution loop handles the rest
Streaming built-in	Real-time responses with callbacks or Streams
Phoenix/Ecto ready	First-class persistence with `mix elixir_llm.gen.ecto`
Minimal deps	Just Req, Jason, and Telemetry—no bloat

Installation

Add to your mix.exs:

def deps do
  [
    {:elixir_llm, "~> 0.2.0"}
  ]
end

Then configure your API keys:

# config/runtime.exs
config :elixir_llm,
  openai: [api_key: System.get_env("OPENAI_API_KEY")],
  anthropic: [api_key: System.get_env("ANTHROPIC_API_KEY")]

Quick Start

Simple Chat

# One-liner
{:ok, response} = ElixirLLM.chat("Explain pattern matching in Elixir")

# With options
{:ok, response} = ElixirLLM.chat("Hello!", model: "claude-sonnet-4-20250514")

Pipe-Based Configuration

{:ok, response, chat} =
  ElixirLLM.new()
  |> ElixirLLM.model("gpt-4o")
  |> ElixirLLM.temperature(0.7)
  |> ElixirLLM.instructions("You are a helpful Elixir expert")
  |> ElixirLLM.ask("What&#39;s the difference between Task and GenServer?")

# Continue the conversation
{:ok, response, chat} = ElixirLLM.ask(chat, "Show me an example")

Streaming

# With callback - tokens appear in real-time
{:ok, response, _chat} =
  ElixirLLM.new()
  |> ElixirLLM.model("gpt-4o")
  |> ElixirLLM.ask("Write a haiku about Elixir", stream: fn chunk ->
    IO.write(chunk.content || "")
  end)

# As Elixir Stream - composable and lazy
ElixirLLM.new()
|> ElixirLLM.stream("Tell me a story")
|> Stream.each(&IO.write(&1.content || ""))
|> Stream.run()

Tools

Let AI call your Elixir code. ElixirLLM handles the entire tool execution loop automatically—no manual back-and-forth required.

Define a Tool

defmodule MyApp.Tools.Weather do
  use ElixirLLM.Tool,
    name: "get_weather",
    description: "Gets current weather for a location"

  param :city, :string, required: true, description: "City name"
  param :units, :string, required: false, description: "celsius or fahrenheit"

  @impl true
  def execute(%{city: city, units: units}) do
    # Your implementation here
    {:ok, %{temperature: 22, conditions: "sunny", city: city}}
  end
end

Use Tools

{:ok, response, _chat} =
  ElixirLLM.new()
  |> ElixirLLM.tool(MyApp.Tools.Weather)
  |> ElixirLLM.ask("What&#39;s the weather in Berlin?")

# The model calls your tool, gets the result, and responds naturally:
# => "The current weather in Berlin is sunny with a temperature of 22°C."

Inline Tools

For quick one-offs:

calculator = ElixirLLM.Tool.define(
  name: "calculator",
  description: "Performs math calculations",
  parameters: %{
    a: [type: :integer, required: true, description: "First number"],
    b: [type: :integer, required: true, description: "Second number"],
    operation: [type: :string, required: true, description: "add, subtract, multiply, divide"]
  },
  execute: fn %{a: a, b: b, operation: op} ->
    result = case op do
      "add" -> a + b
      "subtract" -> a - b
      "multiply" -> a * b
      "divide" -> a / b
    end
    {:ok, "Result: #{result}"}
  end
)

{:ok, response, _} =
  ElixirLLM.new()
  |> ElixirLLM.tool(calculator)
  |> ElixirLLM.ask("What&#39;s 42 * 17?")
# => "42 multiplied by 17 is 714."

Structured Output

Get typed, validated responses with the Schema DSL:

defmodule MyApp.Schemas.Person do
  use ElixirLLM.Schema

  field :name, :string, description: "Full name"
  field :age, :integer, description: "Age in years"
  field :email, :string, required: false

  embeds_one :address do
    field :city, :string
    field :country, :string
  end

  embeds_many :skills, of: :string
end

{:ok, %MyApp.Schemas.Person{} = person, _chat} =
  ElixirLLM.new()
  |> ElixirLLM.schema(MyApp.Schemas.Person)
  |> ElixirLLM.ask("Generate a profile for a senior Elixir developer")

person.name         # => "Alice Chen"
person.age          # => 34
person.address.city # => "San Francisco"
person.skills       # => ["Elixir", "Phoenix", "PostgreSQL"]

Multi-Modal (Images, Audio, PDFs)

# Analyze an image
{:ok, response, _} =
  ElixirLLM.new()
  |> ElixirLLM.model("gpt-4o")
  |> ElixirLLM.ask("What&#39;s in this image?", with: ElixirLLM.image("photo.jpg"))

# From URL
{:ok, response, _} =
  ElixirLLM.new()
  |> ElixirLLM.ask("Describe this", with: ElixirLLM.image_url("https://example.com/photo.jpg"))

# PDF document (Claude)
{:ok, response, _} =
  ElixirLLM.new()
  |> ElixirLLM.model("claude-sonnet-4-20250514")
  |> ElixirLLM.ask("Summarize this contract", with: ElixirLLM.pdf("contract.pdf"))

Embeddings

Generate vector representations for semantic search, similarity, and RAG:

# Single text
{:ok, embedding} = ElixirLLM.embed("Elixir is a functional language")
embedding.vector  # => [-0.019, -0.025, 0.018, ...]

# Batch processing
{:ok, embeddings} = ElixirLLM.embed(["Hello", "World", "Elixir"])

# Semantic similarity
alias ElixirLLM.Embedding

{:ok, e1} = ElixirLLM.embed("The cat sat on the mat")
{:ok, e2} = ElixirLLM.embed("A feline rested on a rug")
{:ok, e3} = ElixirLLM.embed("The stock market crashed")

Embedding.cosine_similarity(e1, e2)  # => 0.635 (similar!)
Embedding.cosine_similarity(e1, e3)  # => 0.064 (different)

Phoenix/Ecto Integration

Persist conversations to your database with one command:

mix elixir_llm.gen.ecto
mix ecto.migrate

This generates:

MyApp.LLM.Chat — Conversation schema
MyApp.LLM.Message — Message schema with token tracking
MyApp.LLM.ToolCall — Tool call history
Migration for all tables

Using Persisted Chats

# Create a chat
{:ok, chat} = MyApp.LLM.Chat.create(%{model_id: "gpt-4o"})

# Ask questions - automatically persisted
{:ok, response, chat} = MyApp.LLM.Chat.ask(chat, "Hello!", repo: MyApp.Repo)

# Later, load and continue the conversation
chat = MyApp.Repo.get(MyApp.LLM.Chat, id) |> MyApp.Repo.preload(:messages)
{:ok, response, chat} = MyApp.LLM.Chat.ask(chat, "What did we discuss?", repo: MyApp.Repo)

LiveView Streaming

Real-time AI responses in your Phoenix app:

def handle_event("send_message", %{"message" => msg}, socket) do
  chat = socket.assigns.chat
  parent = self()

  Task.start(fn ->
    MyApp.LLM.Chat.ask(chat, msg,
      repo: MyApp.Repo,
      stream: fn chunk ->
        send(parent, {:chunk, chunk})
      end
    )
  end)

  {:noreply, assign(socket, loading: true)}
end

def handle_info({:chunk, chunk}, socket) do
  new_response = socket.assigns.response <> (chunk.content || "")
  {:noreply, assign(socket, response: new_response)}
end

Configuration

# config/config.exs
config :elixir_llm,
  default_model: "gpt-4o"

# config/runtime.exs (recommended for secrets)
config :elixir_llm,
  openai: [
    api_key: System.get_env("OPENAI_API_KEY")
    # base_url: "https://api.openai.com/v1"  # optional, for proxies
  ],

  anthropic: [
    api_key: System.get_env("ANTHROPIC_API_KEY")
  ],

  gemini: [
    api_key: System.get_env("GOOGLE_API_KEY")
  ],

  mistral: [
    api_key: System.get_env("MISTRAL_API_KEY")
  ],

  groq: [
    api_key: System.get_env("GROQ_API_KEY")
  ],

  together: [
    api_key: System.get_env("TOGETHER_API_KEY")
  ],

  openrouter: [
    api_key: System.get_env("OPENROUTER_API_KEY")
  ],

  ollama: [
    base_url: "http://localhost:11434"
  ]

# For Ecto integration
config :elixir_llm, :ecto,
  repo: MyApp.Repo,
  chat_schema: MyApp.LLM.Chat,
  message_schema: MyApp.LLM.Message

Providers

ElixirLLM auto-detects the provider from the model name:

Provider	Example Models	Features
OpenAI	`gpt-4o`, `gpt-4.5-preview`, `o1`, `o3-mini`	Chat, Vision, Tools, Streaming, Embeddings
Anthropic	`claude-sonnet-4-20250514`, `claude-opus-4-20250514`	Chat, Vision, PDFs, Tools, Streaming
Google Gemini	`gemini-2.0-flash`, `gemini-1.5-pro`	Chat, Vision, Tools, Streaming
Mistral AI	`mistral-large-latest`, `codestral-latest`	Chat, Tools, Streaming
Groq	`groq/llama-3.3-70b-versatile`, `groq/llama4-scout`	Ultra-fast LPU inference
Together AI	`together/meta-llama/Llama-3.3-70B`	100+ open models
OpenRouter	`openrouter/openai/gpt-4o`, `openrouter/anthropic/claude-3.5-sonnet`	Access 100+ models via single API
Ollama	`llama3.2`, `mistral`, `codellama`	Chat, Tools, Streaming, 100% Local

# Provider is auto-detected from model name
ElixirLLM.new() |> ElixirLLM.model("gpt-4o")                   # => OpenAI
ElixirLLM.new() |> ElixirLLM.model("claude-sonnet-4-20250514") # => Anthropic
ElixirLLM.new() |> ElixirLLM.model("gemini-2.0-flash")         # => Gemini
ElixirLLM.new() |> ElixirLLM.model("mistral-large-latest")     # => Mistral
ElixirLLM.new() |> ElixirLLM.model("groq/llama-3.3-70b-versatile") # => Groq
ElixirLLM.new() |> ElixirLLM.model("together/meta-llama/Llama-3.3-70B") # => Together
ElixirLLM.new() |> ElixirLLM.model("llama3.2")                 # => Ollama

Telemetry

ElixirLLM emits telemetry events for observability, metrics, and debugging:

# Attach a handler
:telemetry.attach("llm-logger", [:elixir_llm, :chat, :stop], fn _event, measurements, metadata, _config ->
  Logger.info("#{metadata.model} responded in #{div(measurements.duration, 1_000_000)}ms")
end, nil)

Events

Event	When
`[:elixir_llm, :chat, :start \| :stop]`	Chat request lifecycle
`[:elixir_llm, :stream, :start \| :stop]`	Streaming request lifecycle
`[:elixir_llm, :tool, :call]`	Tool is being called
`[:elixir_llm, :tool, :result]`	Tool returned a result
`[:elixir_llm, :embed, :start \| :stop]`	Embedding request lifecycle

Resilience Features

ElixirLLM includes built-in resilience patterns for production use.

Retry with Exponential Backoff

Automatically retry failed requests with configurable backoff:

alias ElixirLLM.Retry

# Wrap any operation with retry logic
Retry.with_retry(fn ->
  ElixirLLM.chat("Hello!")
end, max_attempts: 3, base_delay_ms: 1000)

# Options:
#   max_attempts: 3      - Maximum retry attempts
#   base_delay_ms: 1000  - Initial delay between retries
#   max_delay_ms: 30000  - Maximum delay cap
#   jitter: true         - Add randomness to prevent thundering herd

Rate Limiting

Token bucket rate limiter to stay within provider limits:

alias ElixirLLM.RateLimiter

# Check if request is allowed
case RateLimiter.check_rate(:openai) do
  :ok -> ElixirLLM.chat("Hello!")
  {:error, :rate_limited} -> # Handle rate limit
end

# Configure per-provider limits
RateLimiter.configure(:openai, tokens_per_second: 10, bucket_size: 100)

Circuit Breaker

Prevent cascading failures when a provider is down:

alias ElixirLLM.CircuitBreaker

# Execute with circuit breaker protection
case CircuitBreaker.call(:openai, fn -> ElixirLLM.chat("Hello!") end) do
  {:ok, response} -> response
  {:error, :circuit_open} -> # Provider is unhealthy, use fallback
  {:error, reason} -> # Handle other errors
end

# Configure thresholds
CircuitBreaker.configure(:openai,
  failure_threshold: 5,      # Failures before opening circuit
  recovery_timeout_ms: 30000 # Time before attempting recovery
)

Response Caching

Cache responses to reduce API calls and latency:

alias ElixirLLM.Cache

# Cache a response
Cache.put("cache_key", response, ttl_ms: 300_000)

# Retrieve from cache
case Cache.get("cache_key") do
  {:ok, cached_response} -> cached_response
  :miss -> # Fetch fresh response
end

# Configure cache size
Cache.configure(max_size: 1000)  # LRU eviction when exceeded

Error Handling

ElixirLLM provides structured error types for precise error handling:

case ElixirLLM.chat("Hello!") do
  {:ok, response} ->
    response.content

  {:error, %ElixirLLM.RateLimitError{retry_after: seconds}} ->
    Process.sleep(seconds * 1000)
    # Retry...

  {:error, %ElixirLLM.AuthenticationError{}} ->
    Logger.error("Invalid API key")

  {:error, %ElixirLLM.ValidationError{message: msg}} ->
    Logger.error("Invalid request: #{msg}")

  {:error, %ElixirLLM.NetworkError{}} ->
    # Retry with backoff

  {:error, %ElixirLLM.TimeoutError{}} ->
    # Increase timeout or retry

  {:error, %ElixirLLM.ProviderError{provider: provider, message: msg}} ->
    Logger.error("#{provider} error: #{msg}")
end

Error Types

Error	When
`RateLimitError`	API rate limit exceeded (429)
`AuthenticationError`	Invalid or missing API key (401)
`ValidationError`	Invalid request parameters (400)
`NetworkError`	Connection failed
`TimeoutError`	Request timed out
`ProviderError`	Provider-specific error (500, etc.)
`ToolError`	Tool execution failed
`MaxDepthError`	Tool loop exceeded max iterations

Checking Retryability

alias ElixirLLM.Error.Helpers

case ElixirLLM.chat("Hello!") do
  {:error, error} when Helpers.retryable?(error) ->
    # Safe to retry (rate limits, timeouts, network errors)
    Retry.with_retry(fn -> ElixirLLM.chat("Hello!") end)

  {:error, error} ->
    # Don&#39;t retry (auth errors, validation errors)
    {:error, error}
end

Comparison

Feature	ElixirLLM	LangChain	ExLLM
Unified API	Pipe-based	Chain-based	Mixed
Tool DSL	`use ElixirLLM.Tool`	Functions	Basic
Auto tool loop	Yes	Manual	Manual
Schema DSL	Yes	No	Basic
Ecto integration	First-class	No	No
Streaming	Callback + Stream	Callbacks	Basic
Telemetry	Built-in	No	No
Multi-modal	Images, PDFs, Audio	Limited	No

Contributing

We welcome contributions! Here's how to get started:

Fork the repository
Clone your fork: git clone https://github.com/YOUR_USERNAME/elixir_llm.git
Install dependencies: mix deps.get
Run tests: mix test
Create a branch: git checkout -b my-feature
Make your changes
Run the formatter: mix format
Submit a pull request

Development Setup

# Clone and setup
git clone https://github.com/khasinski/elixir_llm.git
cd elixir_llm
mix deps.get

# Run tests (requires API keys in environment)
export OPENAI_API_KEY=your_key
export ANTHROPIC_API_KEY=your_key
mix test

# Generate docs
mix docs

Guidelines

Follow existing code style
Add tests for new features
Update documentation
Keep commits focused and atomic
Write descriptive commit messages

Roadmap

AWS Bedrock provider
Azure OpenAI provider
Function calling with multiple parallel tools
Vision streaming
Audio input/output (Whisper, TTS)
Token counting utilities
Cost estimation

See the CHANGELOG for release history.

License

MIT License. See LICENSE for details.

Acknowledgements

Inspired by the beautiful API design of RubyLLM by @crmne.

Built with love for the Elixir community.

Ready to build something amazing?
mix hex.info elixir_llm