Agent Skills

The official Agent Skills for [ClickHouse](https://clickhouse.com/). These skills help LLMs and agents to adopt best practices when working with

Source: ClickHouse/agent-skills

What Is This?

Overview

ClickHouse Agent Skills is an official collection of structured skill definitions designed to guide large language models and AI agents when working with ClickHouse, the open-source column-oriented database management system. These skills encode best practices, query patterns, and operational knowledge into a format that LLM-powered tools can consume and apply during code generation, query writing, and database administration tasks.

The skill set originates from the official ClickHouse GitHub organization, meaning the guidance it provides reflects the recommendations of the ClickHouse engineering team. Rather than relying on generic database knowledge that an LLM might have absorbed during training, these agent skills supply current, accurate, and ClickHouse-specific context that improves the quality of AI-assisted work.

As AI coding assistants and autonomous agents become more common in data engineering workflows, having a reliable skill layer between the model and the database becomes increasingly important. ClickHouse Agent Skills fills that role by standardizing how agents reason about schema design, query optimization, data ingestion, and cluster management within the ClickHouse ecosystem.

Who Should Use This

Data engineers who use AI coding assistants to write and review ClickHouse queries
Backend developers integrating ClickHouse into applications with the help of LLM tools
DevOps and platform engineers managing ClickHouse clusters who want AI-assisted operational guidance
Analytics engineers building data pipelines and materialized views in ClickHouse environments
AI agent developers building autonomous systems that interact with ClickHouse programmatically
Teams adopting ClickHouse for the first time who want AI guidance grounded in official best practices

Why Use It?

Problems It Solves

Generic LLMs often produce ClickHouse queries that ignore engine-specific optimizations, leading to slow or incorrect results
Agents writing DDL statements without ClickHouse-specific knowledge may choose inappropriate table engines or partition strategies
Without structured skill context, AI tools may suggest PostgreSQL or MySQL patterns that do not translate well to ClickHouse semantics
Developers new to ClickHouse waste time debugging AI-generated code that violates ClickHouse constraints around primary keys, sorting keys, and data types

Core Highlights

Official source from the ClickHouse organization, ensuring accuracy and alignment with current versions
Covers query writing, schema design, table engine selection, and ingestion patterns
Designed for integration with LLM agent frameworks and AI coding assistants
Encodes ClickHouse-specific idioms such as ReplacingMergeTree deduplication and materialized view chaining
Reduces hallucination risk by supplying grounded, domain-specific context to the model
Applicable across ClickHouse Cloud and self-hosted deployments
Supports both analytical query patterns and operational administration tasks

How to Use It?

Basic Usage

To apply these skills in an agent or assistant context, reference the skill definitions from the source repository and supply them as system context or tool descriptions when initializing your LLM session.

git clone https://github.com/ClickHouse/agent-skills.git

Once cloned, you can load individual skill files and inject them into your agent prompt or tool configuration depending on your framework.

with open("agent-skills/query_optimization.md", "r") as f:
    skill_context = f.read()

system_prompt = f"You are a ClickHouse expert. Apply these skills:\n{skill_context}"

Specific Scenarios

Schema Design Assistance: When asking an agent to design a table for time-series event data, the skills guide the model to recommend MergeTree with an appropriate ORDER BY clause and a time-based partition key rather than a generic row-store approach.

Query Review: An agent equipped with these skills can review a submitted query and flag missing PREWHERE clauses, suboptimal JOIN strategies, or unnecessary full-table scans.

Real-World Examples

A data engineering team building a clickstream analytics pipeline uses the skills to have their AI assistant generate correct INSERT statements with proper batch sizing and async insert settings enabled.

A developer building a SaaS application uses the skills to get accurate guidance on using ReplacingMergeTree for upsert-like behavior, including the correct use of the FINAL keyword.

When to Use It?

Use Cases

AI-assisted ClickHouse query generation in development environments
Automated code review pipelines for ClickHouse DDL and DML statements
Onboarding new engineers to ClickHouse with AI tutoring grounded in official guidance
Building autonomous data agents that manage ClickHouse tables and ingestion jobs
Generating ClickHouse migration scripts with correct syntax and engine choices
Validating AI-generated ClickHouse configurations before deployment

Agent Skills

What Is This?

Overview

Who Should Use This

Why Use It?

Problems It Solves

Core Highlights

How to Use It?

Basic Usage

Specific Scenarios

Real-World Examples

When to Use It?

Use Cases

Important Notes

Requirements

More Skills You Might Like

Remotion Best Practices

Playwright Ci Caching

Tanstack Start

Land And Deploy

Kubernetes Manifest Generator

Review