SKT-OM

13B Agentic RAG System + 73+ Plugin Ecosystem

Think Mode - Plugin Architecture - LangGraph - SKT RAG - AMD Developer Cloud
ROCm 7.0 | vLLM FP16 | TIGER-OM 13B

73+ PLUGINS Agents Plugins Think Mode Supervisor Performance Security

Project Resources

HF
HuggingFace Space
Live demo app running on HF
->
BR
TIGER-OM Model
13B LLM trained on AMD Cloud
->
GH
GitHub Repository
Source code and app files
->
WEB
Live Demo
App running on Render
->
PKG
GGUF Quantized
Q4_K_M quantized model
->
TRO
Hackathon Page
AMD Developer Hackathon 2026
->
PDF
Presentation PDF
Full project presentation
->
73+
Total Plugins
12
Categories
6
Core Agents
97.3%
Accuracy
2.3s
Avg Response

How SKT-OM Works

01
🎯

User Query Input

User types any natural language query — simple question, complex multi-hop reasoning, or code-related task. No file uploads needed. Pure text-to-intelligent-text pipeline.

# Example queries SKT-OM handles query = "Compare ROCm 7.0 vs CUDA 12.4 for LLM training efficiency" query = "Explain transformer attention mechanism with code example" query = "What were AMD's key GPU releases in 2025-2026?"
02
🧠

Think Mode Activation

The query enters Think Mode — SKT-OM's reasoning engine. It analyzes intent, detects complexity, classifies domain, and decides whether to decompose into sub-queries or route directly to plugins/RAG.

# Think Mode: Query Analysis & Planning class ThinkMode: def analyze(self, query): intent = self.classify_intent(query) # research | code | math | compare complexity = self.assess_complexity(query) # simple | medium | complex plan = self.create_execution_plan(intent, complexity) return plan # {agents: [...], plugins: [...], mode: "parallel"}
03
🔌

Plugin Router

Based on Think Mode's plan, the Plugin Router dynamically loads required tools. Code execution? → Code Plugin. Math problem? → Math Plugin. Web data? → Search Plugin. Each plugin is a modular, hot-swappable component.

# Plugin Router: Dynamic Tool Loading router = PluginRouter(plugins_dir="plugins/", auto_discover=True) active_plugins = router.route(query_plan) # Returns: [CodePlugin(), MathPlugin(), SearchPlugin()] for plugin in active_plugins: result = plugin.execute(sub_query) evidence.append(result)
04
📚

SKT RAG Retrieval

Simultaneously, the SKT RAG Pipeline kicks in. Query gets rewritten for better retrieval, multi-hop search runs across vector stores, results are reranked and contextually compressed. This ensures only the most relevant knowledge reaches the LLM.

# SKT RAG: Advanced Retrieval Pipeline rag_pipeline = SKTRAG( vector_store="ChromaDB", embedding_model="BGE-large", llm="TIGER-OM-13B" ) rewritten_query = rag_pipeline.rewrite_query(query) docs = rag_pipeline.multi_hop_retrieve(rewritten_query, top_k=10) ranked_docs = rag_pipeline.rerank(docs) compressed = rag_pipeline.contextual_compress(ranked_docs)
05
🤖

LangGraph Agent Execution

LangGraph orchestrates the multi-agent workflow. Agents execute in parallel or sequential mode based on query complexity. Each agent has stateful memory, tool access, and self-correction loops. The Supervisor monitors and coordinates all agents.

# LangGraph: Multi-Agent Orchestration from langgraph import StateGraph workflow = StateGraph(AgentState) workflow.add_node("think", think_mode) workflow.add_node("retrieve", rag_retrieve) workflow.add_node("plugin", plugin_execute) workflow.add_node("synthesize", final_synthesis) workflow.add_edge("think", "retrieve") workflow.add_edge("think", "plugin") workflow.add_edge(["retrieve", "plugin"], "synthesize") app = workflow.compile() result = app.invoke({"query": user_query})
06
✅

Verification & Self-Correction

Before final output, SKT-OM runs verification loops. It checks consistency between plugin results and RAG data, validates calculations, and cross-references facts. If inconsistencies are found, it auto-corrects by re-querying or adjusting the synthesis.

# Verification Engine class Verifier: def verify(self, evidence, draft_response): consistency = self.check_consistency(evidence) facts = self.validate_facts(draft_response) if consistency < 0.85 or facts.failed > 0: return self.trigger_rethink(evidence) return draft_response # Confidence: 97.3%
07
📤

Final Synthesis & Response

All evidence — from plugins, RAG retrieval, and agent reasoning — gets synthesized by the 13B TIGER-OM model running on AMD MI300X via vLLM. The response is accurate, well-structured, cited where possible, and delivered with reasoning chains.

# Final Synthesis on AMD MI300X llm = vLLM( model="TIGER-OM-13B", tensor_parallel=2, gpu_memory_utilization=0.85 ) final_prompt = self.build_synthesis_prompt( query=user_query, rag_context=compressed_docs, plugin_results=plugin_evidence, reasoning_chain=agent_trace ) response = llm.generate(final_prompt, max_tokens=2048) # Output: Structured, cited, reasoned response

Complete Data Flow

🎯
User Query
→
🧠
Think Mode
→
🔌
Plugin Router
→
📚
SKT RAG
→
🤖
LangGraph
→
✅
Verify
→
📤
Response

Live Execution Trace

$ skt-om start --mode=agentic --think
[INIT] SKT-OM 13B Agentic RAG System v2.0
[INFO] Loading TIGER-OM-13B on AMD MI300X...
[OK] Model loaded | VRAM: 14.2GB/32GB | ROCm 7.0
[INFO] Initializing SKT RAG pipeline...
[OK] Vector store: ChromaDB | 47,832 documents indexed
[INFO] Loading plugin ecosystem...
[OK] 73 plugins active: Search, Code, Data, Cloud, AI/ML, Security...
[INFO] LangGraph workflow compiled | 4 nodes, 5 edges
[READY] SKT-OM is listening on port 7860
$ Waiting for query...

Plugin Ecosystem

Google ADK Style Architecture — 73+ Plugins across 12 Categories

ALL (73+)
Search (8)
Code (10)
Data (9)
Communication (7)
Database (8)
Cloud (6)
AI/ML (7)
Security (5)
Monitoring (6)
Automation (5)
Integration (4)
G
Google Search
Search
Custom Search API with result deduplication, snippet extraction, and relevance scoring.
API Web
D
DuckDuckGo
Search
Privacy-focused search with instant answers, news, and image search capabilities.
Privacy Fallback
B
Bing Web Search
Search
Microsoft Bing API with entity recognition, safe search, and market localization.
Microsoft Entity
N
News API
Search
Real-time news aggregation from 30,000+ sources with sentiment analysis and topic clustering.
News Real-time
G
Google Maps
Search
Location search, geocoding, directions, places API, and distance matrix calculations.
Maps Geo
W
Web Scraper
Search
Intelligent web scraping with adaptive CSS selectors, anti-bot bypass, and structured data extraction.
Scraper Data
W
Wikipedia
Search
Wikipedia API with page search, summary extraction, and multi-language support.
Knowledge Wiki
A
ArXiv Search
Search
Scientific paper search with abstract extraction, citation analysis, and PDF download.
Research Academic
P
Python Executor
Code
Sandboxed Python execution with pip install, output capture, and error traceback analysis.
Python Sandbox
J
JavaScript Runner
Code
Node.js execution with npm support, async handling, and console output streaming.
JS Node
J
Java Compiler
Code
Java compilation and execution with Maven/Gradle support and JUnit test running.
Java JVM
G
Go Runner
Code
Go code execution with module support, goroutine handling, and benchmark testing.
Go Golang
R
Rust Compiler
Code
Rust compilation with cargo support, memory safety checks, and performance profiling.
Rust Systems
P
PHP Executor
Code
PHP execution with composer support, framework detection, and MySQL integration.
PHP Web
C
Code Reviewer
Code
Automated code review with linting, security scanning, and style guide enforcement.
Review Lint
T
Test Runner
Code
Multi-language test execution with coverage reporting, flaky test detection, and CI integration.
Testing CI/CD
P
Package Manager
Code
Cross-language package management with dependency resolution, vulnerability scanning, and lock file generation.
Packages Deps
D
Docker Executor
Code
Containerized code execution with Dockerfile support, image caching, and resource limits.
Docker Container
P
Pandas Analyzer
Data
DataFrame operations, statistical analysis, missing value handling, and correlation matrices.
Pandas Stats
M
Matplotlib Viz
Data
Chart generation with line, bar, scatter, heatmap, and 3D visualization support.
Viz Charts
P
Plotly Interactive
Data
Interactive dashboards with zoom, pan, hover tooltips, and export to HTML/PNG.
Interactive Dashboard
N
NumPy Compute
Data
High-performance numerical computing with array operations, linear algebra, and FFT.
NumPy Math
S
SciPy Advanced
Data
Scientific computing with optimization, integration, interpolation, and signal processing.
SciPy Science
S
Scikit-Learn ML
Data
Machine learning with classification, regression, clustering, and model evaluation.
ML Sklearn
C
CSV Processor
Data
CSV parsing, transformation, validation, and export with schema inference and encoding detection.
CSV ETL
J
JSON Transformer
Data
JSON parsing, schema validation, transformation with JMESPath, and pretty printing.
JSON Transform
E
Excel Handler
Data
Excel read/write with formula support, pivot tables, and multi-sheet operations.
Excel Office
E
Email Sender
Communication
SMTP email sending with template support, attachment handling, and delivery tracking.
SMTP Email
S
Slack Messenger
Communication
Slack API integration with channel messaging, thread replies, and file uploads.
Slack Chat
T
Twitter/X API
Communication
Twitter API v2 with tweet posting, timeline reading, and engagement analytics.
Twitter Social
S
SMS Gateway
Communication
SMS sending via Twilio with delivery status, bulk messaging, and template variables.
SMS Twilio
D
Discord Bot
Communication
Discord bot integration with message sending, embeds, and slash command handling.
Discord Bot
V
Voice Call
Communication
Voice call initiation with text-to-speech, call recording, and transcription.
Voice TTS
T
Telegram Bot
Communication
Telegram Bot API with message sending, inline keyboards, and webhook handling.
Telegram Bot
P
PostgreSQL
Database
PostgreSQL connection with SQL execution, pgvector support, and connection pooling.
SQL Vector
M
MongoDB
Database
MongoDB operations with aggregation pipelines, schema validation, and change streams.
NoSQL Document
R
Redis Cache
Database
Redis operations with caching, pub/sub, streams, and vector similarity search.
Cache Redis
M
MySQL Connector
Database
MySQL connection with query execution, transaction support, and replication handling.
SQL MySQL
C
ChromaDB
Database
Vector database with embedding storage, semantic search, and metadata filtering.
Vector Embeddings
P
Pinecone
Database
Pinecone vector DB with hybrid search, metadata filtering, and namespace management.
Vector Cloud
B
BigQuery
Database
Google BigQuery integration with SQL queries, dataset management, and cost optimization.
Analytics Google
S
SQLite
Database
SQLite operations with in-memory databases, file-based storage, and FTS5 full-text search.
Lite Local
A
AWS S3
Cloud
S3 bucket operations with upload, download, presigned URLs, and lifecycle policies.
Storage AWS
A
Azure Blob
Cloud
Azure Blob Storage with container management, SAS tokens, and tiered storage.
Azure Blob
G
GCP Storage
Cloud
Google Cloud Storage with bucket operations, IAM management, and signed URLs.
GCP Storage
D
Docker Hub
Cloud
Docker image management with pull, push, tag operations, and registry search.
Docker Registry
K
Kubernetes
Cloud
K8s cluster operations with pod management, deployment scaling, and log streaming.
K8s DevOps
C
Cloud Functions
Cloud
Serverless function deployment with trigger configuration and execution monitoring.
Serverless FaaS
O
OpenAI GPT
AI/ML
OpenAI API integration with GPT-4, DALL-E, Whisper, and embedding models.
GPT-4 OpenAI
H
HuggingFace
AI/ML
HuggingFace integration with model inference, pipeline execution, and dataset loading.
HF Transformers
A
Anthropic Claude
AI/ML
Claude API with extended context, reasoning, and tool use capabilities.
Claude Anthropic
I
Image Generator
AI/ML
Image generation with Stable Diffusion, DALL-E, and Midjourney-style prompts.
Image GenAI
S
Speech Recognition
AI/ML
Speech-to-text with Whisper, real-time transcription, and speaker diarization.
ASR Audio
T
Text Summarizer
AI/ML
Abstractive and extractive summarization with length control and key point extraction.
NLP Summary
T
Translation
AI/ML
Multi-language translation with 100+ languages, context preservation, and domain adaptation.
NLP Translate
A
Auth Manager
Security
OAuth 2.0, JWT, and API key management with token rotation and scope enforcement.
Auth OAuth
R
Rate Limiter
Security
Token bucket rate limiting with Redis backend, per-user quotas, and burst handling.
Rate Limit
P
PII Detector
Security
PII detection and masking with regex patterns, NER models, and compliance reporting.
PII Privacy
V
Vulnerability Scan
Security
Dependency vulnerability scanning with CVE database, severity scoring, and fix suggestions.
CVE Scan
E
Encryption
Security
AES-256 encryption with key management, HSM integration, and secure key rotation.
AES Encrypt
P
Prometheus Metrics
Monitoring
Prometheus metrics export with custom counters, histograms, and gauge collection.
Metrics Prometheus
L
Log Aggregator
Monitoring
Centralized logging with structured JSON, log levels, and Elasticsearch forwarding.
Logs ELK
A
APM Tracer
Monitoring
Distributed tracing with OpenTelemetry, span collection, and Jaeger integration.
Trace OTel
A
Alert Manager
Monitoring
Alert routing with PagerDuty, Slack, and email notifications with escalation policies.
Alert PagerDuty
H
Health Check
Monitoring
System health monitoring with endpoint checks, dependency validation, and status pages.
Health Status
S
SLO Tracker
Monitoring
Service Level Objective tracking with error budget calculation and burn rate alerts.
SLO SRE
W
Workflow Engine
Automation
Visual workflow builder with conditional logic, loops, and parallel execution.
Workflow Builder
C
Cron Scheduler
Automation
Cron job scheduling with timezone support, retry logic, and job monitoring.
Cron Schedule
W
Webhook Handler
Automation
Webhook reception with signature verification, payload parsing, and event routing.
Webhook Events
C
CI/CD Pipeline
Automation
GitHub/GitLab CI integration with build triggers, test execution, and deployment.
CI/CD DevOps
T
Task Queue
Automation
Background task processing with priority queues, retries, and dead letter handling.
Queue Background
G
GitHub API
Integration
GitHub integration with repo management, PR reviews, issue tracking, and Actions triggers.
GitHub Git
G
GitLab API
Integration
GitLab integration with project management, CI pipelines, and merge request handling.
GitLab Git
J
Jira Connector
Integration
Jira integration with issue creation, sprint management, and workflow transitions.
Jira Project
N
Notion API
Integration
Notion integration with page creation, database queries, and block manipulation.
Notion Docs

PLUGIN ARCHITECTURE

# SKT-OM Plugin System - Google ADK Style from skt_om.plugins import PluginManager, BasePlugin # Initialize plugin manager with 73+ plugins manager = PluginManager( plugins_dir="plugins/", auto_discover=True, lazy_load=True ) # Think Mode auto-routes to appropriate plugins plan = think_mode.analyze(query) # Returns: {plugins: ["Search", "Code", "Data"], mode: "parallel"} # Dynamic plugin loading active_plugins = manager.load_plugins(plan.plugins) # Execute all plugins in parallel results = await manager.execute_parallel(active_plugins, query) # Register custom plugin class MyCustomPlugin(BasePlugin): name = "custom_tool" version = "1.0.0" async def execute(self, context): # Your custom logic here return result manager.register(MyCustomPlugin())