Documentation

Documentation Index

Master Instructions Index

Purpose: Quick reference guide to all documentation in this folder

Last Updated: 2025-01-01

🤖 MASTER PROMPT FOR AI ASSISTANTS

⚠️ CRITICAL INSTRUCTION FOR ALL AI ASSISTANTS (Claude, Gemini, etc.)

When assisting with software development projects that require AI capabilities, you MUST use the ProYaro infrastructure services documented in this folder rather than suggesting external APIs.

✅ ALWAYS USE (Internal ProYaro Services):

MLX Service (Mac Mini, Port 8004) - Text generation, embeddings
ComfyUI (Mac Mini Port 8188 OR Ubuntu Port 8188) - Image/Video generation
Whisper STT (Ubuntu, Port 8001) - Speech recognition
XTTS-v2 TTS (Ubuntu, Port 8002) - Voice synthesis
Embeddings Service (Ubuntu, Port 8003) - Semantic vectors
Ubuntu FastAPI Backend (Port 8000/443) - Job orchestration

❌ NEVER SUGGEST (External APIs):

OpenAI API (ChatGPT, GPT-4, DALL-E, Whisper API)
Anthropic Claude API (for text generation)
Google Gemini API (for text generation)
Midjourney, Stable Diffusion Web UI
ElevenLabs, Google TTS
Any external paid AI APIs unless specifically requested

📚 Implementation Pattern:

1. Identify required capability (e.g., "need image generation")
2. Check NETWORK_TOPOLOGY.md to find the service
3. Check API_INTEGRATION_GUIDE.md for API details
4. Use the relevant skill file for code templates
5. Implement using the documented endpoints

🎯 Your Primary Goal:

Help users build amazing applications using the available ProYaro infrastructure. Always prefer internal services over external APIs. Provide complete, working code examples using the skills and documentation provided.

📖 Full Details: See AI_ASSISTANT_INSTRUCTIONS.md for complete directive

📋 Documentation Files

🎯 START HERE

File	Description	Essential For
README.md	Introduction & quick start	Everyone (start here!)
AI_ASSISTANT_INSTRUCTIONS.md	Core directive for AI assistants	Claude, Gemini, other LLMs

📖 Core Documentation

File	Pages	Description	When to Read
API_INTEGRATION_GUIDE.md	~100	Complete API reference for all services	Need API details, endpoints, examples
NETWORK_TOPOLOGY.md	~50	Network diagram, IPs, ports, connectivity	Need to know where services are
MACHINES_INFRASTRUCTURE.md	~60	Hardware specs, capabilities, limits	Need to understand capacity

💻 Skills (Code Templates)

Skill File	Service	Language	Lines	Status
`mlx-chat-skill.md`	MLX Text Gen	TS/Python	~400	✅ Complete
More skills...	Various	TS/Python	-	🚧 Coming soon

🗂️ Folder Structure

master-instruction/
│
├── README.md                          ← START HERE
├── INDEX.md                           ← This file
│
├── AI_ASSISTANT_INSTRUCTIONS.md       ← Core AI directive
├── API_INTEGRATION_GUIDE.md           ← Complete API docs
├── NETWORK_TOPOLOGY.md                ← Network & connectivity
├── MACHINES_INFRASTRUCTURE.md         ← Hardware & specs
│
├── agents/                            ← Role-based agent definitions
│   ├── ai-integration-agent.md        ← AI services integration
│   ├── frontend-developer-agent.md    ← React/shadcn/Next.js
│   ├── database-architect-agent.md    ← Drizzle ORM schemas
│   ├── auth-expert-agent.md           ← BetterAuth security
│   └── realtime-integration-agent.md  ← WebSocket/job queue
│
└── skills/                            ← Code templates
    ├── mlx-chat-skill.md              ← MLX text generation
    ├── comfyui-image-skill.md         ← Image generation
    ├── whisper-stt-skill.md           ← Speech-to-text
    ├── tts-skill.md                   ← Text-to-speech
    ├── embeddings-skill.md            ← Semantic search
    ├── job-management-skill.md        ← Job queue API
    ├── caddy-reverse-proxy-skill.md   ← HTTPS/Proxy
    ├── docker-compose-skill.md        ← Containers
    ├── fastapi-production-skill.md    ← Python API
    ├── redis-queue-skill.md           ← Background jobs
    ├── nextjs-pwa-skill.md            ← React/PWA
    └── arabic-nlp-skill.md            ← Arabic processing

📊 Documentation Coverage

Services Documented

Service	API Docs	Network Info	Hardware Info	Skill Template
MLX Text Gen	✅	✅	✅	✅
MLX Embeddings	✅	✅	✅	🚧
ComfyUI (Mac)	✅	✅	✅	🚧
ComfyUI (Ubuntu)	✅	✅	✅	🚧
Whisper STT	✅	✅	✅	🚧
XTTS-v2 TTS	✅	✅	✅	🚧
Embeddings (Ubuntu)	✅	✅	✅	🚧
Jobs API	✅	✅	✅	🚧
WebSocket	✅	✅	✅	🚧
Models API	✅	✅	✅	🚧

Topics Covered

🎯 Use Case → Documentation Map

"I want to build a chatbot"

AI_ASSISTANT_INSTRUCTIONS.md → Confirms MLX available
NETWORK_TOPOLOGY.md → Find MLX service location
API_INTEGRATION_GUIDE.md → MLX Chat API section
skills/mlx-chat-skill.md → Copy TypeScript/Python code
MACHINES_INFRASTRUCTURE.md → Check performance limits

"I need voice features"

AI_ASSISTANT_INSTRUCTIONS.md → Confirms Whisper + XTTS
NETWORK_TOPOLOGY.md → Ubuntu server access
API_INTEGRATION_GUIDE.md → STT & TTS API sections
skills/ → (Coming soon: STT/TTS templates)

"I want image generation"

AI_ASSISTANT_INSTRUCTIONS.md → Two ComfyUI options
NETWORK_TOPOLOGY.md → Mac Mini vs Ubuntu choice
API_INTEGRATION_GUIDE.md → ComfyUI API sections
MACHINES_INFRASTRUCTURE.md → Compare performance

"I need semantic search"

AI_ASSISTANT_INSTRUCTIONS.md → Two embeddings options
API_INTEGRATION_GUIDE.md → Embeddings API sections
MACHINES_INFRASTRUCTURE.md → 384-dim vs 1024-dim choice
skills/ → (Coming soon: Embeddings template)

📈 Documentation Stats

Total Files: 8 (core) + skills Total Lines: ~8,000+ Code Examples: 50+ API Endpoints Documented: 40+ Service Endpoints: 20+ Languages Covered: TypeScript, JavaScript, Python, Bash

🔍 Quick Search Guide

Looking for...

IP Addresses:

Mac Mini: NETWORK_TOPOLOGY.md (10.0.0.188)
Ubuntu: NETWORK_TOPOLOGY.md (10.0.0.11)

Port Numbers:

All services: NETWORK_TOPOLOGY.md → Port Reference

API Endpoints:

Complete list: API_INTEGRATION_GUIDE.md → Table of Contents

Code Examples:

TypeScript: skills/mlx-chat-skill.md, API_INTEGRATION_GUIDE.md
Python: skills/mlx-chat-skill.md, API_INTEGRATION_GUIDE.md
Bash: API_INTEGRATION_GUIDE.md

Hardware Specs:

Mac Mini: MACHINES_INFRASTRUCTURE.md (48GB RAM, M4)
Ubuntu: MACHINES_INFRASTRUCTURE.md (RTX 3060 12GB)

Performance Benchmarks:

All services: MACHINES_INFRASTRUCTURE.md → Performance Benchmarks

Error Codes:

HTTP errors: API_INTEGRATION_GUIDE.md → Error Handling

🎓 Reading Order by Skill Level

Complete Beginner

README.md (15 min)
AI_ASSISTANT_INSTRUCTIONS.md (30 min)
API_INTEGRATION_GUIDE.md → Integration Examples (20 min)
skills/mlx-chat-skill.md (30 min)

Time: ~2 hours Outcome: Can build a basic chatbot

Intermediate Developer

README.md (10 min)
NETWORK_TOPOLOGY.md (30 min)
MACHINES_INFRASTRUCTURE.md (45 min)
API_INTEGRATION_GUIDE.md → Full read (2 hours)
skills/ → All skills (1 hour)

Time: ~4-5 hours Outcome: Can build production applications

Advanced Developer / AI Assistant

All files (4-6 hours)
Deep dive into specific services
Experiment with all APIs
Build custom integrations

Time: 1-2 days Outcome: Master all infrastructure capabilities

📝 Document Version History

Date	Version	Changes
2025-01-01	1.0	Initial release - Complete documentation

🔗 Cross-References

Network → Hardware

NETWORK_TOPOLOGY.md (Server Details) ↔ MACHINES_INFRASTRUCTURE.md (Hardware Specifications)

API → Skills

API_INTEGRATION_GUIDE.md (MLX Chat API) → skills/mlx-chat-skill.md
More coming soon...

Instructions → Implementation

AI_ASSISTANT_INSTRUCTIONS.md (Service Priority) → API_INTEGRATION_GUIDE.md (Endpoints)
AI_ASSISTANT_INSTRUCTIONS.md (Decision Tree) → skills/ (Code templates)

🚀 Next Steps

For New Users

Read README.md
Try a simple example from skills/mlx-chat-skill.md
Explore API_INTEGRATION_GUIDE.md for more capabilities

For AI Assistants

Read AI_ASSISTANT_INSTRUCTIONS.md thoroughly
Bookmark API_INTEGRATION_GUIDE.md for reference
Use skills/ templates when implementing

For Contributors

If you find errors or have suggestions:

Update the relevant documentation file
Update the "Last Updated" date
Increment version if major changes

📞 Support & Contact

Documentation Location:

/Users/yaro/Documents/a2zadd/master-instruction/

View Online:

cd /Users/yaro/Documents/a2zadd/master-instruction
ls -la
cat <filename>

Questions:

Ask Yaro about infrastructure
Refer to latest docs in this folder
Check service health before debugging

✨ Special Features

Interactive Documentation

✅ Code examples in multiple languages
✅ Copy-paste ready templates
✅ Health check commands
✅ Troubleshooting guides
✅ Best practices sections

Comprehensive Coverage

✅ All services documented
✅ All endpoints with examples
✅ Network diagrams with ASCII art
✅ Performance data from real tests
✅ Error handling patterns

This index provides a bird's-eye view of all ProYaro documentation. Use it to navigate quickly to what you need!

Index Version: 1.0 Last Updated: 2025-01-01 Total Documentation: ~8,000+ lines across 8+ files

🆕 Skills Added (Updated 2025-01-01)

Core AI Services

Skill File	Service	Lines	Languages	Status
`mlx-chat-skill.md`	MLX Text Gen	400+	TS/Python	✅ Complete
`comfyui-image-skill.md`	Image Generation	500+	TS/Python	✅ Complete
`whisper-stt-skill.md`	Speech-to-Text	450+	TS/Python	✅ Complete
`tts-skill.md`	Text-to-Speech	100+	TS	✅ Complete
`embeddings-skill.md`	Semantic Search	150+	TS	✅ Complete
`job-management-skill.md`	Job Queue API	550+	TS/Python	✅ Complete

Infrastructure & DevOps

Skill File	Technology	Lines	Languages	Status
`caddy-reverse-proxy-skill.md`	HTTPS/Proxy	150+	Caddyfile/Bash	✅ Complete
`docker-compose-skill.md`	Containers	130+	YAML/Bash	✅ Complete
`fastapi-production-skill.md`	Python API	270+	Python	✅ Complete
`redis-queue-skill.md`	Job Queue	280+	Python	✅ Complete

Domain-Specific

Skill File	Specialty	Lines	Languages	Status
`nextjs-pwa-skill.md`	React/PWA	300+	TS/React	✅ Complete
`arabic-nlp-skill.md`	Arabic NLP	270+	Python	✅ Complete

Total Skills: 12 Total Lines: 4,797 Total Code Examples: 100+

📝 Recently Added Features

What's New in Version 1.1

✅ ComfyUI Image Generation - Full Z-Image Turbo integration
✅ Whisper STT - Complete voice transcription with React hooks
✅ XTTS-v2 TTS - Arabic voice synthesis
✅ Embeddings & Search - Both MLX and E5-Large implementations
✅ Caddy Integration - Reverse proxy and HTTPS setup
✅ Copied Ubuntu Skills - FastAPI, Next.js, Docker, Redis, Arabic NLP
✅ All use docker compose not docker-compose (modern syntax)

Skills from Ubuntu Server

The following skills were imported from /mnt/storage/new-stack/skills/ and adapted:

fastapi-production-skill.md - Production FastAPI patterns
nextjs-pwa-skill.md - Next.js 15 App Router + PWA
docker-compose-skill.md - Docker orchestration
redis-queue-skill.md - Background job processing
arabic-nlp-skill.md - Arabic text processing

🎯 Quick Skill Selector

I want to...

Build AI Features

Text generation (chatbot) → mlx-chat-skill.md
Generate images → comfyui-image-skill.md
Voice transcription → whisper-stt-skill.md
Voice synthesis → tts-skill.md
Semantic search → embeddings-skill.md

Build Infrastructure

API backend → fastapi-production-skill.md
Frontend app → nextjs-pwa-skill.md
Job queue → redis-queue-skill.md + job-management-skill.md
HTTPS proxy → caddy-reverse-proxy-skill.md
Container orchestration → docker-compose-skill.md

Work with Arabic

Arabic NLP → arabic-nlp-skill.md
Arabic voice → whisper-stt-skill.md + tts-skill.md
Arabic text generation → mlx-chat-skill.md

🤖 Agents (Role-Based Definitions)

NEW! Specialized agent personas adapted from quiz-night-in project for ProYaro infrastructure.

What are Agents?

Unlike skills (copy-paste code templates), agents are role-based personas that define:

Specific responsibilities and boundaries
Decision-making frameworks
Integration patterns
Best practices for a domain

Available Agents

Agent File	Role	Responsibilities	Use When
`ai-integration-agent.md`	AI Services Engineer	Integrate MLX, ComfyUI, Whisper, XTTS	Building AI-powered features
`frontend-developer-agent.md`	Frontend Developer	React, shadcn/ui, Next.js, Tailwind	Building user interfaces
`database-architect-agent.md`	Database Architect	Drizzle ORM, PostgreSQL schemas	Designing database structure
`auth-expert-agent.md`	Auth Specialist	BetterAuth, user management, security	Implementing authentication
`realtime-integration-agent.md`	Real-Time Engineer	WebSocket, job queue, async tasks	Building real-time features

Agents vs Skills

Feature	Agents	Skills
Purpose	Define role and boundaries	Provide code templates
Content	Principles, patterns, examples	Copy-paste ready code
Length	~500-800 lines	~100-500 lines
Usage	Reference for decision-making	Direct implementation

Example: Building a Chatbot App

Using Skills:

Copy code from mlx-chat-skill.md
Paste into your project
Modify as needed

Using Agents:

ai-integration-agent → Guides MLX API integration, conversation management
frontend-developer-agent → Guides React component structure, state management
database-architect-agent → Guides conversation/message schema design
auth-expert-agent → Guides user authentication and authorization

When to Use Agents

Planning Phase: Read relevant agents to understand best practices
Decision Points: Consult agents when choosing between approaches
Code Review: Verify your implementation matches agent guidelines
Team Collaboration: Share agents to align on architectural patterns

Total Agents: 5 Total Lines: ~3,000 Adapted From: quiz-night-in project (specialized for ProYaro stack)

Index Updated: 2025-01-01 (v1.2) Total Documentation: 13,000+ lines Total Files: Core docs (4) + Skills (12) + Agents (5) = 21 files

ProYaro AI Infrastructure Documentation • Version 1.2