Metering

Uses: Metering & Billing Konnect API

Use Case	Description
AI Tokens	Metering AI Tokens consumed by LLMs and AI agents.
API Requests	Metering API Requests count and duration.
Compute	Metering runtime of VMs, CPUs, GPUs, etc.
Seats	Metering number of unique users over sessions.

Aggregation types

Aggregation types determine how usage data is aggregated for generic meters.

You can configure the following aggregation types for generic meters:

Aggregation Type	Description
COUNT	The `COUNT` aggregation type counts the number of events that occur within a specific time window. This is often used for metrics that are inherently countable, such as the number of transactions processed or API calls made. The `COUNT` aggregation type doesn’t have the `valueProperty`.
SUM	The `SUM` aggregation type calculates the total sum of the metered values for a specific time window. `SUM` aggregates over the events `valueProperty`. This is useful for accumulating metrics like total LLM tokens used, total data transferred, or total time spent on a service.
UNIQUE COUNT	The `UNIQUE_COUNT` aggregation type counts the number of unique events. This is useful when events are unique by a specific field. The `valueProperty` defines the field that makes the ingested event unique. The property’s value in the ingested event must be a string or number.
LATEST	The `LATEST` aggregation type returns the latest value for a specific time window. This is useful for when you track the size of a resource on your own and report periodically the value of it to Metering & Billing. For example disk size, number of resources or seats. The latest aggregation takes the last value reported for the period.
MIN	The `MIN` aggregation type identifies the minimum value among the metered data points within a specific time window. This is useful for metrics where the lowest value is of interest, such as minimum available storage or minimum response time.
MAX	The `MAX` aggregation type identifies the maximum value among the metered data points within a specific time window. This is useful for metrics where the highest value is of interest, such as maximum load on a server or maximum transaction value.

Event ingestion

Metering & Billing ingests Konnect API Gateway and LLM events automatically when they’re enabled. If you want to configure generic meters, you must use the CloudEvents format for event ingestion.

As CloudEvents is generic, here are some best practices for defining events in OpenMeter:

Name	Description	Examples
Subject (API Property: `event.subject`)	Subjects in OpenMeter are entities that consume resources you wish to meter. These can range from users, servers, and services to devices. The design of subjects is intentionally generic, enabling flexible application across various metering scenarios. Typically, a subject acts as a unique identifier within your system for a user or customer.	Customer ID or User ID Hostname or IP address Service or Application name Device ID
Source Property (API Property: `event.source`)	The event’s source (e.g. the service name). As events are unique by id and source, set different sources if you report the same transaction in multiple applications.	`my-service-name` `my-application-name`
Choosing Event ID	Events are unique by `id` and `source` properties for idempotency. OpenMeter deduplicates events by uniqueness. Therefore, picking an ID that makes the event unique and resilient to retries is important. For example, in the case of a metering API call, this can be the request ID. You can generate a new UUID if your application doesn’t have a unique identifier.	HTTP Request ID, typically in headers: `Request-ID`, `X-Request-ID` LLM Chat Completion ID: `id` field in ChatGPT response Workflow ID: like activity ID in Temporal Generate UUID: Node.js, Python, Go
Data Property (API Property: `event.data`)	OpenMeter uses CloudEvents format’s data property to ingest values and group bys. Be sure to always include in this data property what the meter requires, like value property and group bys.	Always include value property for non-`COUNT` aggregations. Always include group by properties. Use string quotation for numbers to preserve precision like `\"123\"`.

Create a meter

To configure a meter in Konnect, do the following:

To meter Kong Gateway API requests, you need traffic to a Gateway Service andRoute.

In the Konnect sidebar, click Metering & Billing.
Enable Gateway.
Select a Gateway
Click Enable Gateway

To meter Kong AI Gateway LLM token usage, you must have the AI Proxy plugin configured.

In the Konnect sidebar, click Metering & Billing.
Enable AI Gateway Tokens.

You will see kong_konnect_llm_tokens available from the list of available meters.

Example meter use cases

The following example show how you can configure meters and usage events for common use cases.

LLM token usage

If you want to meter Kong AI Gateway LLM token usage, you can enable the built-in integration to meter usage in one click.

In most cases, AI applications want to count token usage for billing or cost control purposes. As a single AI interaction involves consuming multiple tokens, we define our generic meter with the SUM aggregation and report token usage in the data’s tokens property. As most LLMs charge differently for input, output and system prompts and different models it makes sense to add model and prompt type to the group by.

meters:
  - slug: tokens_total
    description: AI Token Usage
    # Filter events by type
    eventType: prompt
    aggregation: SUM
    # JSONPath to parse usage value
    valueProperty: $.tokens
    groupBy:
      # Model used: gpt4-turbo, etc.
      model: $.model
      # Prompt type: input, output, system
      type: $.type

Copied!

{
  "specversion": "1.0",
  "type": "prompt",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "chat-app",
  "subject": "customer-1",
  "data": {
    "tokens": "123456",
    "model": "gpt4-turbo",
    "type": "output"
  }
}

Copied!

GPU time

Metering GPUs is a common use-case for customer billing, internal charge back and cost control use cases.

meters:
  - slug: gpu_execution_duration_seconds
    description: GPU Time
    eventType: gpu_time
    aggregation: SUM
    valueProperty: $.duration_seconds
    groupBy:
     	hostname: $.hostname
     	region: $.region
      # Type of GPU: e.g. nvidia_A100
 	    gpu_type: $.gpu_type

Copied!

{
  "specversion": "1.0",
  "type": "gpu_time",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "my-image-generator",
  "subject": "customer-1",
  "data": {
    "duration_seconds": "12345",
    "hostname": "my-hostname",
    "region": "us-east-1",
    "gpu_type": "nvidia_A100"
  }
}

Copied!

API request count

If you want to meter Kong Gateway API requests, you can enable the built-in integration to meter usage in one click.

Products monetizing API usage may want to count the number of requests. With choosing the COUNT aggregation each event will increase the meter by one. For grouping we can add method and route. Note how we report the route template not the actual HTTP path to avoid differences around IDs and dynamic routes.

meters:
  - slug: api_requests_total
    description: API Requests
    # Filter events by type
    eventType: request
    aggregation: COUNT
    groupBy:
      # HTTP Method: GET, POST, etc.
      method: $.method
      # Route: /products/:product_id
      route: $.route

Copied!

{
  "specversion": "1.0",
  "type": "request",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "api-service",
  "subject": "customer-1",
  "data": {
    "method": "GET",
    "route": "/products/:product_id"
  }
}

Copied!

API request duration

Similar to the API request, you can decide to track the request duration. This is basically how serverless products like AWS Lambda charge their customers. If you want to track both the request count and duration, you can check out our advanced example.

meters:
  - slug: api_request_duration
    description: API Request Duration
    # Filter events by type
    eventType: request
    aggregation: SUM
    # JSONPath to parse duration value
    valueProperty: $.duration_seconds
    groupBy:
      # HTTP Method: GET, POST, etc.
      method: $.method
      # Route: /products/:product_id
      route: $.route

Copied!

{
  "specversion": "1.0",
  "type": "request",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "api-service",
  "subject": "customer-1",
  "data": {
    "method": "GET",
    "route": "/products/:product_id",
    "duration_seconds": "12345"
  }
}

Copied!

Kubernetes pod execution duration

To track Kubernetes pod execution duration, use our native Kubernetes collector that already reports usage events in this format.

meters:
  - slug: pod_execution_time
    description: Pod Execution Time
    eventType: kube-pod-exec-time
    aggregation: SUM
    valueProperty: $.duration_seconds
    groupBy:
      pod_name: $.pod_name
      pod_namespace: $.pod_namespace

Copied!

{
  "specversion": "1.0",
  "type": "kube-pod-exec-time",
  "id": "f7f11c82-c415-4325-aec7-572606681817",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "my-app",
  "subject": "customer-1",
  "data": {
    "duration_seconds": "123",
    "pod_name": "pod_name",
    "pod_namespace": "pod_namespace"
  }
}

Copied!

Counting unique events

In some cases, you may want to count unique events, such as unique sessions. To achieve this, you can use the UNIQUE_COUNT aggregation.

meters:
  - slug: unique_sessions_total
    description: Unique Sessions
    eventType: login
    aggregation: UNIQUE_COUNT
    valueProperty: $.session_id

Copied!

{
  "specversion": "1.0",
  "type": "auth_check",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "auth-service",
  "subject": "customer-1",
  "data": {
    "session_id": "session_id"
  }
}

Copied!

Moving multiple meters with one event

In OpenMeter, a single event can move multiple meters if the event type matches. Let’s see an example of tracking an API request’s occurrence, execution duration, and network usage.

meters:
  - slug: api_requests_total
    description: API Requests
    eventType: request
    aggregation: COUNT
    groupBy:
      method: $.method
      route: $.route
  - slug: api_request_duration_seconds
    description: API Request Duration
    eventType: request
    aggregation: SUM
    valueProperty: $.duration_seconds
    groupBy:
      method: $.method
      route: $.route
  - slug: api_request_ingress_bytes
    description: Request Ingress Bytes
    eventType: request
    aggregation: SUM
    valueProperty: $.ingress_bytes
    groupBy:
      method: $.method
      route: $.route

Copied!

{
  "specversion": "1.0",
  "type": "request",
  "id": "00001",
  "time": "2024-01-01T00:00:00.001Z",
  "source": "api-service",
  "subject": "customer-1",
  "data": {
    "method": "GET",
    "route": "/products/:product_id",
    "duration_seconds": "123",
    "ingress_bytes": "456",
    "egress_bytes": "789"
  }
}

Copied!

Counting state changes

In some cases you want to count how many states a workflow or task went through as it progresses for example from created to in_progress and success. The challenge is that if you report a usage event for every state change and track state as a group by answering simple questions like how many workflows were in total would always require filtering by states like created, which is easy to forget and error-prone.

The recommended way to model states is to create separate meters per state.

meters:
  - slug: workflow_created
    description: Workflows Created
    eventType: workflow_create
    aggregation: COUNT
    groupBy:
      task_type: $.task_type
  - slug: workflow_succeeded
    description: Workflow Succeeded
    eventType: workflow_success
    aggregation: COUNT
    groupBy:
      task_type: $.task_type
  - slug: workflow_failed
    description: Workflow Failed
    eventType: workflow_fail
    aggregation: COUNT
    groupBy:
      task_type: $.task_type

Copied!

[
  {
    "specversion": "1.0",
    "type": "workflow_create",
    "id": "00001",
    "time": "2024-01-01T00:00:00.001Z",
    "source": "task-queue",
    "subject": "task-1",
    "data": {
      "type": "image-generate"
    }
  },
  {
    "specversion": "1.0",
    "type": "workflow_success",
    "id": "00001",
    "time": "2024-01-01T00:00:00.001Z",
    "source": "task-queue",
    "subject": "task-1",
    "data": {
      "task_type": "image-generate"
    }
  }
]

Copied!

Translate AI demo

Example meters for the imaginary Translate AI product that translates PDF documents between languages. For example, you can use an LLM like GPT-4 to translate a PDF document from German to English. For this use case, you want to track the number of pages, words, and LLM tokens used for each translation.

Meter to count the number of pages translated:

meters:
  - slug: pages_total
    description: Number of pages translated
    eventType: translate
    aggregation: SUM
    valueProperty: $.pages

Copied!

Meter to count the number of words translated:

meters:
  - slug: words_total
    description: Number of words translated
    eventType: translate
    aggregation: SUM
    valueProperty: $.words

Copied!

Meter to count the number of LLM tokens used:

meters:
  - slug: tokens_total
    description: Number of LLM tokens used
    eventType: translate
    aggregation: SUM
    valueProperty: $.tokens
    groupBy:
      model: $.model

Copied!

{
  "specversion": "1.0",
  "id": "0b8bb322-90c2-4b46-b541-0380bac1b9a5",
  "source": "myapp",
  "type": "translate",
  "subject": "customer-123",
  "datacontenttype": "application/json",
  "time": "2025-02-18T16:37:44Z",
  "data": {
    "model": "gpt-4",
    "pages": 23,
    "tokens": 10200,
    "words": 6912
  }
}

Copied!

FAQs

Do I need to bill or create plans for my meters?

No, you can use metering on it’s own to track customer usage.

Metering

Aggregation types

Event ingestion

Create a meter

Example meter use cases

LLM token usage

GPU time

API request count

API request duration

Kubernetes pod execution duration

Counting unique events

Moving multiple meters with one event

Counting state changes

Translate AI demo

FAQs

Help us make these docs great!

Still need help