igmarin/elixir-phoenix-skills

Curated library of 38 atomic skills, 7 personas, and 1 orchestrator for Elixir and Phoenix development. Organized by category: fundamentals, phoenix, database, testing, auth, infrastructure, quality, security, integrations, tooling, frameworks, personas, and orchestration. Covers core Elixir patterns, Phoenix LiveView, Ecto, OTP, Oban, testing, security, deployment, real-time, and modern tooling (Req, Swoosh, Cachex, Broadway, Ash).

Quality

91%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

name:: broadway-data-pipelines
type:: atomic
tags:: atomic
license:: MIT
description:: MANDATORY when building data processing pipelines or consuming message queues. Invoke before implementing GenStage or Broadway consumers. Covers Broadway setup, producers, processors, batchers, and error handling. Trigger words: Broadway, GenStage, data pipeline, message queue, consumer, producer, batcher, SQS, Kafka, RabbitMQ, broadway_sqs, broadway_kafka, handle_message, handle_batch, handle_failed, Broadway.start_link, Broadway.Message, push_message, dead letter queue, DLQ.
metadata:: {"user-invocable":"true","version":"1.0.0"}

Broadway Data Pipelines

Name: igmarin/elixir-phoenix-skills
Rating: 73.48 (1 reviews)
Author: igmarin

RULES — Follow these with no exceptions

Use Broadway.Message.failed/2 for errors — never raise in handle_message/3
Implement handle_failed/2 — dead-letter handling must be explicit for every pipeline
Use batchers for database inserts — don't insert one-by-one; batch size of 100 is a good default
Configure :status in start_link — set :max_restarts, :max_seconds for production resilience
Test with Broadway.Test.push_message/2 — verify each message type including failures
Wire telemetry — attach handlers to Broadway's telemetry events for observability

End-to-End Setup Workflow

Follow these steps in order when building a new Broadway pipeline:

Add dependencies — add broadway (and any producer library) to mix.exs
Define the pipeline module — implement handle_message/3 and handle_batch/4 callbacks
Add to supervision tree — include the pipeline module in application.ex
Verify producer connectivity — confirm the producer connects on startup
Test with a single message — use Broadway.Test helpers before scaling concurrency
Scale concurrency — tune processor and batcher concurrency based on CPU cores and throughput targets
Validate error handling — intentionally send a bad message and confirm handle_failed/2 fires
Enable observability — wire up telemetry events and optionally add broadway_dashboard; see Broadway Telemetry docs and broadway_dashboard

Producer libraries: For SQS use broadway_sqs, for Kafka use broadway_kafka, for RabbitMQ use broadway_rabbitmq. See each library's README for producer-specific configuration.

Setup

# mix.exs
defp deps do
  [
    {:broadway, "~> 1.0"},
    {:broadway_dashboard, "~> 0.3"}  # Optional: LiveDashboard integration
  ]
end

Production-Ready Pipeline

defmodule MyApp.MessagePipeline do
  use Broadway

  def start_link(_opts) do
    Broadway.start_link(__MODULE__,
      name: __MODULE__,
      producer: [
        module: {BroadwaySQS.Producer, queue_url: System.get_env("SQS_QUEUE_URL")}
      ],
      processors: [
        default: [concurrency: 10]
      ],
      batchers: [
        default: [concurrency: 5, batch_size: 100, batch_timeout: 2000]
      ]
    )
  end

  @impl true
  def handle_message(_, message, _context) do
    case process(message.data) do
      {:ok, result} ->
        message
        |> Broadway.Message.update_data(fn _ -> result end)
        |> Broadway.Message.put_batcher(:default)

      {:error, reason} ->
        Broadway.Message.failed(message, reason)
    end
  end

  @impl true
  def handle_failed(messages, _context) do
    Enum.each(messages, fn message ->
      Logger.error("Message failed: #{inspect(message.data)}")
      DeadLetterQueue.send(message.data, message.status.reason)
    end)
    messages
  end

  @impl true
  def handle_batch(:default, messages, _batch_info, _context) do
    data = Enum.map(messages, & &1.data)
    case MyApp.Repo.insert_all(MyApp.Record, data) do
      {_count, _} ->
        messages

      {:error, reason} ->
        Logger.error("Batch failed: #{inspect(reason)}")
        Enum.map(messages, &Broadway.Message.failed(&1, reason))
    end
  end

  defp process(%{"body" => body} = data) do
    sanitized = %{data | "body" => String.slice(body || "", 0, 10_000)}
    {:ok, Map.put(sanitized, :processed_at, DateTime.utc_now())}
  end
  defp process(data) when is_map(data) do
    {:ok, Map.put(data, :processed_at, DateTime.utc_now())}
  end
  defp process(data) when is_binary(data) do
    case Jason.decode(data) do
      {:ok, parsed} -> process(parsed)
      {:error, _} -> {:error, :invalid_json}
    end
  end
end

Supervision Tree

# lib/my_app/application.ex
def start(_type, _args) do
  children = [
    # ...
    MyApp.MessagePipeline
  ]

  Supervisor.start_link(children, strategy: :one_for_one)
end

Testing

defmodule MyApp.MessagePipelineTest do
  use ExUnit.Case
  import Broadway.Test

  test "processes a single message" do
    ref = push_message(MyApp.MessagePipeline, %{id: 1, value: "hello"})
    assert_receive {:ack, ^ref, [%{data: %{id: 1}}], []}
  end

  test "marks malformed messages as failed" do
    ref = push_message(MyApp.MessagePipeline, nil)
    assert_receive {:ack, ^ref, [], [_failed]}
  end
end

Retry Strategies

Broadway does not provide a built-in backoff/requeue mechanism at the message level. For retry logic:

Short-term retries: wrap process/1 in a retry library (e.g., Retry) and let failures bubble to handle_failed/2.
Long-term retries / exponential backoff: send failed messages to a dead-letter queue and re-enqueue from there, or use producer-level redelivery mechanisms.

@impl true
def handle_message(_, message, _context) do
  attempt = Map.get(message.metadata, :retry_count, 0)

  case process(message.data) do
    {:ok, result} ->
      Broadway.Message.update_data(message, fn _ -> result end)

    {:error, reason} when attempt < 3 ->
      Broadway.Message.failed(message, {:retryable, reason})

    {:error, reason} ->
      Broadway.Message.failed(message, {:max_retries_exceeded, reason})
  end
end

Producer Configurations

SQS

Broadway.start_link(__MODULE__,
  name: __MODULE__,
  producer: [
    module: {
      BroadwaySQS.Producer,
      queue_url: System.get_env("SQS_QUEUE_URL"),
      config: [
        region: "us-west-2",
        max_number_of_messages: 10,
        wait_time_seconds: 20
      ]
    },
    concurrency: 1
  ],
  processors: [
    default: [
      concurrency: 10,
      max_demand: 10,
      min_demand: 5
    ]
  ],
  batchers: [
    default: [
      concurrency: 5,
      batch_size: 100,
      batch_timeout: 5_000
    ]
  ]
)

Kafka

Broadway.start_link(__MODULE__,
  name: __MODULE__,
  producer: [
    module: {
      BroadwayKafka.Producer,
      brokers: ["localhost:9092"],
      group_id: "my_consumer_group",
      topics: ["my-topic"]
    }
  ],
  processors: [
    default: [concurrency: 10]
  ],
  batchers: [
    default: [concurrency: 5, batch_size: 100, batch_timeout: 5_000]
  ]
)

Telemetry

Broadway emits telemetry events for message processing and batching. Attach handlers via :telemetry.attach_many/4 in your application startup:

:telemetry.attach_many(
  "broadway-handler",
  [
    [:broadway, :message, :start],
    [:broadway, :message, :stop],
    [:broadway, :message, :failure],
    [:broadway, :batch, :start],
    [:broadway, :batch, :stop]
  ],
  &MyApp.Telemetry.handle_event/4,
  %{}
)

Optionally visualise metrics with broadway_dashboard. See the Broadway Telemetry guide for full event names and metadata shapes.

Concurrency Tuning

# CPU-bound: fewer workers, lower demand
# I/O-bound: more workers, higher demand
processors: [
  default: [
    concurrency: System.schedulers_online() * 2,  # multiply by 4 for heavy I/O
    max_demand: 50                                 # raise to 100 for I/O-bound
  ]
]

batchers: [
  default: [
    concurrency: System.schedulers_online(),
    batch_size: 100,
    batch_timeout: 5_000
  ]
]

.tessl-plugin

skills

auth

database

frameworks

fundamentals

infrastructure

broadway-data-pipelines

SKILL.md

cachex-caching

deployment-gotchas

oban-essentials

telemetry-essentials

integrations

orchestration

personas

phoenix

quality

security

testing

tooling

README.md

tile.json