---
title: "Services"
description: "Monitor platform service status and control service lifecycle"
source_url: https://ai-ops.com/docs/system/services
---

# Services

Navigate to **System > Service Status** to view the status of all Koios services and start, stop, or restart them as needed.

---

## Service Tables

Services are divided into two tables:

### Application Services

| Service | Description |
|---------|-------------|
| **Web App** | Web application server (API, GraphQL) |
| **Celery Worker** | Asynchronous task queue |
| **Data Collector** | Industrial device polling |
| **Predict Engine** | AI model inference |
| **Expression Evaluator** | Calculated tags and user-defined expressions |
| **Parameter Mapping** | Data transformation and normalization |
| **Performance Monitor** | System metrics collection |
| **Component Engine** | Custom component execution |

### Infrastructure

| Service | Description |
|---------|-------------|
| **Configuration Database** | Stores device, tag, user, and model configuration |
| **Time-Series Database** | Stores historical tag values for trends and analysis |
| **In-Memory Cache** | Caches live tag values and status for real-time access |
| **Nginx** | Web proxy and SSL termination |

Each table shows the service name, status, CPU usage, memory usage, PID, and uptime.

---

## Status Indicators

| Status | Color | Meaning |
|--------|-------|---------|
| **Running** | Teal | Service is healthy and responding |
| **Stopped** | Gray | Service is not running |
| **Failed** | Red | Service crashed or exited with an error |
| **Starting** | Yellow | Service is in the process of starting |
| **Stopping** | Yellow | Service is in the process of stopping |
| **Unresponsive** | Orange | Process is running but not sending healthy heartbeats |
| **Standalone** | Orange | Heartbeats are healthy but the process is not managed by the service manager |

Hover over any status badge for a plain-language explanation.

---

## Service Detail Drawer

Click any service row to open its detail drawer.

### Overview Tab

- **Description** — what the service does
- **Metrics** — CPU, memory, thread count (application services only), PID, and uptime
- **Last Heartbeat** — when the service last reported in (application services only). Shows as "Last Seen" if the service is stopped.
- **Error Info** — if the service has a reported error, the error message and detail are displayed
- **Service Info** — the system service name, category, and current process state

### Diagnostics Tab

Appears only for application services that have diagnostic data available. Shows real-time workload metrics:

- **Pressure** — thread pool utilization as a rolling average. Indicates how busy the service is.
- **In-Flight** — tasks currently executing
- **Queued** — tasks waiting to execute

If the service supports on-demand operations (like the Data Collector or Predict Engine), a separate set of on-demand pressure, in-flight, and queued metrics is also shown.

**Top Contributors** — for the Predict Engine, lists the AI models putting the highest load on the service, with progress bars and contribution percentages. Click a model name to navigate to its detail page.

A badge appears on the Diagnostics tab when pressure reaches warning levels (yellow at 70%, red at 90%).

---

## Service Actions

Action buttons appear in the detail drawer footer based on the service's current state:

| Current State | Available Actions |
|---------------|-------------------|
| **Stopped / Failed** | Start |
| **Running / Unresponsive** | Restart, Stop |
| **Starting / Stopping** | None (wait for transition) |

- **Start** and **Restart** execute immediately
- **Stop** shows a confirmation dialog before proceeding

After any action, the service list refreshes automatically.

---

## Service Diagnostics Settings

Below the service tables, a **Service Diagnostics** section lets you configure advanced metric recording for application services. These settings control whether pressure, queue depth, and top contributor data are collected and made available in the detail drawer's Diagnostics tab.

---

## What's Next

- [System Health](https://ai-ops.com/docs/system/performance.md) — CPU, memory, disk, and network monitoring with alarms
- [Logs](https://ai-ops.com/docs/system/logs.md) — stream real-time logs for any service
