💬 コミュニケーションコミュニティ 🟢 非エンジニアでもOK 👤 管理職・人事・カスタマー対応

💬 エージェントDBPerformance最適化

AgentDB Performance Optimization

AgentDBのデータ処理を高速化し、メモリ

⚡ ⏱ 社内アナウンス文 30分 → 3分

📺 まず動画で見る(YouTube)

※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。

📜 元の英語説明(参考)

Optimize AgentDB performance with quantization (4-32x memory reduction), HNSW indexing (150x faster search), caching, and batch operations. Use when optimizing memory usage, improving search speed, or scaling to millions of vectors.

🇯🇵 日本人クリエイター向け解説

一言でいうと

AgentDBのデータ処理を高速化し、メモリ

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o agentdb-performance-optimization.zip https://jpskill.com/download/2115.zip && unzip -o agentdb-performance-optimization.zip && rm agentdb-performance-optimization.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/2115.zip -OutFile "$d\agentdb-performance-optimization.zip"; Expand-Archive "$d\agentdb-performance-optimization.zip" -DestinationPath $d -Force; ri "$d\agentdb-performance-optimization.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して agentdb-performance-optimization.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → agentdb-performance-optimization フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 1

💬 こう話しかけるだけ — サンプルプロンプト

› AgentDB Performance Optimizati で、お客様への返信文を作って
› AgentDB Performance Optimizati を使って、社内向けアナウンスを書いて
› AgentDB Performance Optimizati で、メールテンプレートを整備して

これをClaude Code に貼るだけで、このSkillが自動発動します。

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

[Skill 名] AgentDB パフォーマンス最適化

AgentDB パフォーマンス最適化

このスキルがすること

AgentDB ベクトルデータベース向けの包括的なパフォーマンス最適化手法を提供します。量子化、HNSW インデックス作成、キャッシング戦略、およびバッチ操作を通じて、150倍から12,500倍のパフォーマンス向上を実現します。精度を維持しながら、メモリ使用量を4倍から32倍削減します。

パフォーマンス: ベクトル検索は100µs未満、パターン検索は1ms未満、100ベクトルの一括挿入は2msです。

前提条件

Node.js 18+
AgentDB v1.0.7+ (agentic-flow 経由)
既存の AgentDB データベースまたはアプリケーション

クイックスタート

パフォーマンスベンチマークの実行

# 包括的なパフォーマンスベンチマーク
npx agentdb@latest benchmark

# 結果は以下を示します:
# ✅ パターン検索: 150倍高速 (100µs vs 15ms)
# ✅ バッチ挿入: 500倍高速 (100ベクトルで2ms vs 1s)
# ✅ 大規模クエリ: 12,500倍高速 (100万ベクトルで8ms vs 100s)
# ✅ メモリ効率: 量子化により4-32倍削減

最適化の有効化

import { createAgentDBAdapter } from 'agentic-flow$reasoningbank';

// 最適化された設定
const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$optimized.db',
  quantizationType: 'binary',   // 32倍のメモリ削減
  cacheSize: 1000,               // インメモリキャッシュ
  enableLearning: true,
  enableReasoning: true,
});

量子化戦略

1. バイナリ量子化 (32倍削減)

最適: 大規模デプロイメント (100万+ベクトル)、メモリ制約のある環境 トレードオフ: 精度損失約2-5%、メモリ32倍削減、10倍高速

const adapter = await createAgentDBAdapter({
  quantizationType: 'binary',
  // 768次元 float32 (3072バイト) → 96バイト バイナリ
  // 100万ベクトル: 3GB → 96MB
});

ユースケース:

モバイル$エッジデプロイメント
大規模ベクトルストレージ (数百万のベクトル)
メモリ制約のあるリアルタイム検索

パフォーマンス:

メモリ: 32倍小さい
検索速度: 10倍高速 (ビット演算)
精度: オリジナルの95-98%

2. スカラー量子化 (4倍削減)

最適: パフォーマンス$精度のバランス、中規模データセット トレードオフ: 精度損失約1-2%、メモリ4倍削減、3倍高速

const adapter = await createAgentDBAdapter({
  quantizationType: 'scalar',
  // 768次元 float32 (3072バイト) → 768バイト (uint8)
  // 100万ベクトル: 3GB → 768MB
});

ユースケース:

高精度を必要とする本番アプリケーション
中規模デプロイメント (1万-100万ベクトル)
汎用的な最適化

パフォーマンス:

メモリ: 4倍小さい
検索速度: 3倍高速
精度: オリジナルの98-99%

3. プロダクト量子化 (8-16倍削減)

最適: 高次元ベクトル、バランスの取れた圧縮 トレードオフ: 精度損失約3-7%、メモリ8-16倍削減、5倍高速

const adapter = await createAgentDBAdapter({
  quantizationType: 'product',
  // 768次元 float32 (3072バイト) → 48-96バイト
  // 100万ベクトル: 3GB → 192MB
});

ユースケース:

高次元埋め込み (>512次元)
画像$動画埋め込み
大規模類似性検索

パフォーマンス:

メモリ: 8-16倍小さい
検索速度: 5倍高速
精度: オリジナルの93-97%

4. 量子化なし (フル精度)

最適: 最大限の精度、小規模データセット トレードオフ: 精度損失なし、フルメモリ使用

const adapter = await createAgentDBAdapter({
  quantizationType: 'none',
  // フル float32 精度
});

HNSW インデックス作成

Hierarchical Navigable Small World - O(log n) の検索複雑度

自動 HNSW

AgentDB は HNSW インデックスを自動的に構築します。

const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$vectors.db',
  // HNSW は自動的に有効になります
});

// HNSW を使用した検索 (線形スキャン15msに対し100µs)
const results = await adapter.retrieveWithReasoning(queryEmbedding, {
  k: 10,
});

HNSW パラメータ

// 高度な HNSW 設定
const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$vectors.db',
  hnswM: 16,              // レイヤーあたりの接続数 (デフォルト: 16)
  hnswEfConstruction: 200, // 構築品質 (デフォルト: 200)
  hnswEfSearch: 100,       // 検索品質 (デフォルト: 100)
});

パラメータチューニング:

M (接続数): 高いほどリコールが向上し、メモリ使用量が増加します。
- 小規模データセット (<1万): M = 8
- 中規模データセット (1万-10万): M = 16
- 大規模データセット (>10万): M = 32
efConstruction: 高いほどインデックス品質が向上し、構築が遅くなります。
- 高速構築: 100
- バランス: 200 (デフォルト)
- 高品質: 400
efSearch: 高いほどリコールが向上し、検索が遅くなります。
- 高速検索: 50
- バランス: 100 (デフォルト)
- 高リコール: 200

キャッシング戦略

インメモリパターンキャッシュ

const adapter = await createAgentDBAdapter({
  cacheSize: 1000,  // 最も使用頻度の高いパターン1000個をキャッシュ
});

// 初回検索: 約2ms (データベース)
// 以降: 1ms未満 (キャッシュヒット)
const result = await adapter.retrieveWithReasoning(queryEmbedding, {
  k: 10,
});

キャッシュチューニング:

小規模アプリケーション: 100-500パターン
中規模アプリケーション: 500-2000パターン
大規模アプリケーション: 2000-5000パターン

LRU キャッシュの動作

// キャッシュは最も使用頻度の低いパターンを自動的に削除します
// 最も頻繁にアクセスされるパターンはキャッシュに残ります

// キャッシュパフォーマンスの監視
const stats = await adapter.getStats();
console.log('Cache Hit Rate:', stats.cacheHitRate);
// 80%以上のヒット率を目指します

バッチ操作

バッチ挿入 (500倍高速)

// ❌ 遅い: 個別挿入
for (const doc of documents) {
  await adapter.insertPattern({ /* ... */ });  // 100ドキュメントで1秒
}

// ✅ 速い: バッチ挿入
const patterns = documents.map(doc => ({
  id: '',
  type: 'document',
  domain: 'knowledge',
  pattern_data: JSON.stringify({
    embedding: doc.embedding,
    text: doc.text,
  }),
  confidence: 1.0,
  usage_count: 0,
  success_count: 0,
  created_at: Date.now(),
  last_used: Date.now(),
}));

// 一度にすべて挿入 (100ドキュメントで2ms)
for (const pattern of patterns) {
  await adapter.insertPattern(patte

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

AgentDB Performance Optimization

What This Skill Does

Provides comprehensive performance optimization techniques for AgentDB vector databases. Achieve 150x-12,500x performance improvements through quantization, HNSW indexing, caching strategies, and batch operations. Reduce memory usage by 4-32x while maintaining accuracy.

Performance: <100µs vector search, <1ms pattern retrieval, 2ms batch insert for 100 vectors.

Prerequisites

Node.js 18+
AgentDB v1.0.7+ (via agentic-flow)
Existing AgentDB database or application

Quick Start

Run Performance Benchmarks

# Comprehensive performance benchmarking
npx agentdb@latest benchmark

# Results show:
# ✅ Pattern Search: 150x faster (100µs vs 15ms)
# ✅ Batch Insert: 500x faster (2ms vs 1s for 100 vectors)
# ✅ Large-scale Query: 12,500x faster (8ms vs 100s at 1M vectors)
# ✅ Memory Efficiency: 4-32x reduction with quantization

Enable Optimizations

import { createAgentDBAdapter } from 'agentic-flow$reasoningbank';

// Optimized configuration
const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$optimized.db',
  quantizationType: 'binary',   // 32x memory reduction
  cacheSize: 1000,               // In-memory cache
  enableLearning: true,
  enableReasoning: true,
});

Quantization Strategies

1. Binary Quantization (32x Reduction)

Best For: Large-scale deployments (1M+ vectors), memory-constrained environments Trade-off: ~2-5% accuracy loss, 32x memory reduction, 10x faster

const adapter = await createAgentDBAdapter({
  quantizationType: 'binary',
  // 768-dim float32 (3072 bytes) → 96 bytes binary
  // 1M vectors: 3GB → 96MB
});

Use Cases:

Mobile$edge deployment
Large-scale vector storage (millions of vectors)
Real-time search with memory constraints

Performance:

Memory: 32x smaller
Search Speed: 10x faster (bit operations)
Accuracy: 95-98% of original

2. Scalar Quantization (4x Reduction)

Best For: Balanced performance$accuracy, moderate datasets Trade-off: ~1-2% accuracy loss, 4x memory reduction, 3x faster

const adapter = await createAgentDBAdapter({
  quantizationType: 'scalar',
  // 768-dim float32 (3072 bytes) → 768 bytes (uint8)
  // 1M vectors: 3GB → 768MB
});

Use Cases:

Production applications requiring high accuracy
Medium-scale deployments (10K-1M vectors)
General-purpose optimization

Performance:

Memory: 4x smaller
Search Speed: 3x faster
Accuracy: 98-99% of original

3. Product Quantization (8-16x Reduction)

Best For: High-dimensional vectors, balanced compression Trade-off: ~3-7% accuracy loss, 8-16x memory reduction, 5x faster

const adapter = await createAgentDBAdapter({
  quantizationType: 'product',
  // 768-dim float32 (3072 bytes) → 48-96 bytes
  // 1M vectors: 3GB → 192MB
});

Use Cases:

High-dimensional embeddings (>512 dims)
Image$video embeddings
Large-scale similarity search

Performance:

Memory: 8-16x smaller
Search Speed: 5x faster
Accuracy: 93-97% of original

4. No Quantization (Full Precision)

Best For: Maximum accuracy, small datasets Trade-off: No accuracy loss, full memory usage

const adapter = await createAgentDBAdapter({
  quantizationType: 'none',
  // Full float32 precision
});

HNSW Indexing

Hierarchical Navigable Small World - O(log n) search complexity

Automatic HNSW

AgentDB automatically builds HNSW indices:

const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$vectors.db',
  // HNSW automatically enabled
});

// Search with HNSW (100µs vs 15ms linear scan)
const results = await adapter.retrieveWithReasoning(queryEmbedding, {
  k: 10,
});

HNSW Parameters

// Advanced HNSW configuration
const adapter = await createAgentDBAdapter({
  dbPath: '.agentdb$vectors.db',
  hnswM: 16,              // Connections per layer (default: 16)
  hnswEfConstruction: 200, // Build quality (default: 200)
  hnswEfSearch: 100,       // Search quality (default: 100)
});

Parameter Tuning:

M (connections): Higher = better recall, more memory
- Small datasets (<10K): M = 8
- Medium datasets (10K-100K): M = 16
- Large datasets (>100K): M = 32
efConstruction: Higher = better index quality, slower build
- Fast build: 100
- Balanced: 200 (default)
- High quality: 400
efSearch: Higher = better recall, slower search
- Fast search: 50
- Balanced: 100 (default)
- High recall: 200

Caching Strategies

In-Memory Pattern Cache

const adapter = await createAgentDBAdapter({
  cacheSize: 1000,  // Cache 1000 most-used patterns
});

// First retrieval: ~2ms (database)
// Subsequent: <1ms (cache hit)
const result = await adapter.retrieveWithReasoning(queryEmbedding, {
  k: 10,
});

Cache Tuning:

Small applications: 100-500 patterns
Medium applications: 500-2000 patterns
Large applications: 2000-5000 patterns

LRU Cache Behavior

// Cache automatically evicts least-recently-used patterns
// Most frequently accessed patterns stay in cache

// Monitor cache performance
const stats = await adapter.getStats();
console.log('Cache Hit Rate:', stats.cacheHitRate);
// Aim for >80% hit rate

Batch Operations

Batch Insert (500x Faster)

// ❌ SLOW: Individual inserts
for (const doc of documents) {
  await adapter.insertPattern({ /* ... */ });  // 1s for 100 docs
}

// ✅ FAST: Batch insert
const patterns = documents.map(doc => ({
  id: '',
  type: 'document',
  domain: 'knowledge',
  pattern_data: JSON.stringify({
    embedding: doc.embedding,
    text: doc.text,
  }),
  confidence: 1.0,
  usage_count: 0,
  success_count: 0,
  created_at: Date.now(),
  last_used: Date.now(),
}));

// Insert all at once (2ms for 100 docs)
for (const pattern of patterns) {
  await adapter.insertPattern(pattern);
}

Batch Retrieval

// Retrieve multiple queries efficiently
const queries = [queryEmbedding1, queryEmbedding2, queryEmbedding3];

// Parallel retrieval
const results = await Promise.all(
  queries.map(q => adapter.retrieveWithReasoning(q, { k: 5 }))
);

Memory Optimization

Automatic Consolidation

// Enable automatic pattern consolidation
const result = await adapter.retrieveWithReasoning(queryEmbedding, {
  domain: 'documents',
  optimizeMemory: true,  // Consolidate similar patterns
  k: 10,
});

console.log('Optimizations:', result.optimizations);
// {
//   consolidated: 15,  // Merged 15 similar patterns
//   pruned: 3,         // Removed 3 low-quality patterns
//   improved_quality: 0.12  // 12% quality improvement
// }

Manual Optimization

// Manually trigger optimization
await adapter.optimize();

// Get statistics
const stats = await adapter.getStats();
console.log('Before:', stats.totalPatterns);
console.log('After:', stats.totalPatterns);  // Reduced by ~10-30%

Pruning Strategies

// Prune low-confidence patterns
await adapter.prune({
  minConfidence: 0.5,     // Remove confidence < 0.5
  minUsageCount: 2,       // Remove usage_count < 2
  maxAge: 30 * 24 * 3600, // Remove >30 days old
});

Performance Monitoring

Database Statistics

# Get comprehensive stats
npx agentdb@latest stats .agentdb$vectors.db

# Output:
# Total Patterns: 125,430
# Database Size: 47.2 MB (with binary quantization)
# Avg Confidence: 0.87
# Domains: 15
# Cache Hit Rate: 84%
# Index Type: HNSW

Runtime Metrics

const stats = await adapter.getStats();

console.log('Performance Metrics:');
console.log('Total Patterns:', stats.totalPatterns);
console.log('Database Size:', stats.dbSize);
console.log('Avg Confidence:', stats.avgConfidence);
console.log('Cache Hit Rate:', stats.cacheHitRate);
console.log('Search Latency (avg):', stats.avgSearchLatency);
console.log('Insert Latency (avg):', stats.avgInsertLatency);

Optimization Recipes

Recipe 1: Maximum Speed (Sacrifice Accuracy)

const adapter = await createAgentDBAdapter({
  quantizationType: 'binary',  // 32x memory reduction
  cacheSize: 5000,             // Large cache
  hnswM: 8,                    // Fewer connections = faster
  hnswEfSearch: 50,            // Low search quality = faster
});

// Expected: <50µs search, 90-95% accuracy

Recipe 2: Balanced Performance

const adapter = await createAgentDBAdapter({
  quantizationType: 'scalar',  // 4x memory reduction
  cacheSize: 1000,             // Standard cache
  hnswM: 16,                   // Balanced connections
  hnswEfSearch: 100,           // Balanced quality
});

// Expected: <100µs search, 98-99% accuracy

Recipe 3: Maximum Accuracy

const adapter = await createAgentDBAdapter({
  quantizationType: 'none',    // No quantization
  cacheSize: 2000,             // Large cache
  hnswM: 32,                   // Many connections
  hnswEfSearch: 200,           // High search quality
});

// Expected: <200µs search, 100% accuracy

Recipe 4: Memory-Constrained (Mobile/Edge)

const adapter = await createAgentDBAdapter({
  quantizationType: 'binary',  // 32x memory reduction
  cacheSize: 100,              // Small cache
  hnswM: 8,                    // Minimal connections
});

// Expected: <100µs search, ~10MB for 100K vectors

Scaling Strategies

Small Scale (<10K vectors)

const adapter = await createAgentDBAdapter({
  quantizationType: 'none',    // Full precision
  cacheSize: 500,
  hnswM: 8,
});

Medium Scale (10K-100K vectors)

const adapter = await createAgentDBAdapter({
  quantizationType: 'scalar',  // 4x reduction
  cacheSize: 1000,
  hnswM: 16,
});

Large Scale (100K-1M vectors)

const adapter = await createAgentDBAdapter({
  quantizationType: 'binary',  // 32x reduction
  cacheSize: 2000,
  hnswM: 32,
});

Massive Scale (>1M vectors)

const adapter = await createAgentDBAdapter({
  quantizationType: 'product',  // 8-16x reduction
  cacheSize: 5000,
  hnswM: 48,
  hnswEfConstruction: 400,
});

Troubleshooting

Issue: High memory usage

# Check database size
npx agentdb@latest stats .agentdb$vectors.db

# Enable quantization
# Use 'binary' for 32x reduction

Issue: Slow search performance

// Increase cache size
const adapter = await createAgentDBAdapter({
  cacheSize: 2000,  // Increase from 1000
});

// Reduce search quality (faster)
const result = await adapter.retrieveWithReasoning(queryEmbedding, {
  k: 5,  // Reduce from 10
});

Issue: Low accuracy

// Disable or use lighter quantization
const adapter = await createAgentDBAdapter({
  quantizationType: 'scalar',  // Instead of 'binary'
  hnswEfSearch: 200,           // Higher search quality
});

Performance Benchmarks

Test System: AMD Ryzen 9 5950X, 64GB RAM

Operation	Vector Count	No Optimization	Optimized	Improvement
Search	10K	15ms	100µs	150x
Search	100K	150ms	120µs	1,250x
Search	1M	100s	8ms	12,500x
Batch Insert (100)	-	1s	2ms	500x
Memory Usage	1M	3GB	96MB	32x (binary)

Learn More

Quantization Paper: docs$quantization-techniques.pdf
HNSW Algorithm: docs$hnsw-index.pdf
GitHub: https:/$github.com$ruvnet$agentic-flow$tree$main$packages$agentdb
Website: https:/$agentdb.ruv.io

Category: Performance / Optimization Difficulty: Intermediate Estimated Time: 20-30 minutes