diff --git a/README.md b/README.md index 71785a1..ed6d2f5 100644 --- a/README.md +++ b/README.md @@ -23,13 +23,13 @@ ```mermaid flowchart TD - MIC["🎤 麦克风"] --> STT["Faster-Whisper\n中文语音识别"] - STT --> RULE{"规则解析引擎\n简单指令匹配"} - RULE -- "命中\n松开 / 复位 / 方向移动" --> ACT["JSON 动作指令"] - RULE -- "未命中\n含物体名的复杂指令" --> LLM["DeepSeek-R1-1.5B\nQLoRA FP16\n自然语言 → JSON"] + MIC["🎤 麦克风"] --> STT["Faster-Whisper
中文语音识别"] + STT --> RULE{"规则解析引擎
简单指令匹配"} + RULE -- "命中" --> ACT["JSON 动作指令"] + RULE -- "未命中(含物体名)" --> LLM["DeepSeek-R1-1.5B
QLoRA FP16
自然语言 → JSON"] LLM --> ACT - ACT --> VIS["YOLOv8s + Homography\n目标检测 · 手眼标定\n像素坐标 → 机械臂坐标 mm"] - VIS --> MOT["arm_main.py\nD-H 逆运动学 + S-Curve"] + ACT --> VIS["YOLOv8s + Homography
目标检测 · 手眼标定
像素坐标 → 机械臂坐标 mm"] + VIS --> MOT["arm_main.py
D-H IK + S-Curve"] MOT --> ESP["ESP32 PWM → 舵机"] ``` diff --git a/README_EN.md b/README_EN.md index 67a1e62..9f308eb 100644 --- a/README_EN.md +++ b/README_EN.md @@ -23,13 +23,13 @@ Total hardware cost **¥317 (~$45 USD)**. Requires an NVIDIA GPU for LLM inferen ```mermaid flowchart TD - MIC["🎤 Microphone"] --> STT["Faster-Whisper\nChinese speech recognition"] - STT --> RULE{"Regex engine\nSimple command match"} - RULE -- "Hit\nrelease / reset / directional" --> ACT["JSON action"] - RULE -- "Miss\ncomplex command with object name" --> LLM["DeepSeek-R1-1.5B\nQLoRA FP16\nNatural language → JSON"] + MIC["🎤 Microphone"] --> STT["Faster-Whisper
Chinese speech recognition"] + STT --> RULE{"Regex engine
Simple command match"} + RULE -- "Hit" --> ACT["JSON action"] + RULE -- "Miss (has object name)" --> LLM["DeepSeek-R1-1.5B
QLoRA FP16
Natural language → JSON"] LLM --> ACT - ACT --> VIS["YOLOv8s + Homography\nObject detection · hand-eye calibration\nPixel coords → robot coords mm"] - VIS --> MOT["arm_main.py\nD-H IK + S-Curve trajectory"] + ACT --> VIS["YOLOv8s + Homography
Object detection · hand-eye calibration
Pixel coords → robot coords mm"] + VIS --> MOT["arm_main.py
D-H IK + S-Curve"] MOT --> ESP["ESP32 PWM → Servos"] ```