robot_arm

m1ngsama/robot_arm

Fork 0

mirror of https://github.com/m1ngsama/robot_arm.git synced 2026-03-25 19:53:49 +00:00

Commit graph

Author	SHA1	Message	Date
m1ngsama	f2ad220cc8	refactor(audio): move full pipeline into RobotEar.get_text(); add config constants config.py: - Add AUDIO_SILENCE_THRESHOLD, AUDIO_SILENCE_MARGIN, AUDIO_MIN_DURATION, AUDIO_MAX_DURATION so all audio tunables live in one place whisper_main.py: - RobotEar.get_text() now owns the full pipeline: silence trimming, duration guards, WAV write, Whisper transcription with all options - _fix_recognition() moved here from RobotApp (ASR post-processing belongs in the ear layer, not the application layer) - Add `import re`, `import config`; remove unused `sounddevice` import voice_main.py: - Remove `import scipy.io.wavfile` (WAV handling moved to whisper_main) - get_audio_text() is now a one-liner: return self.ear.get_text(self.audio_frames) - Remove _fix_recognition() (lives in RobotEar now) Closes #9	2026-02-20 21:45:16 +08:00
m1ngsama	54c1dc8c01	refactor: extract magic constants to config.py (closes #7 ) Create config.py with env-var-overridable hardware constants: - SERIAL_PORT / SERIAL_BAUD - LLM_MODEL_PATH / YOLO_MODEL_PATH - Z_HOVER / Z_GRAB / Z_AFTER_PICK (Z_TABLE eliminated — same as Z_GRAB) - REST_X / REST_Y / REST_Z (replaces three copies of [120,0,60]) - WS_X / WS_Y / WS_Z (workspace clipping bounds) - DAMPING_BUFFER_SIZE / DAMPING_MAX_SPEED / DAMPING_FACTOR - OFFSET_Y / OFFSET_Z arm_main.py: port/baud/damping defaults now read from config. voice_main.py: all magic constants replaced; remaining Chinese non-i18n strings translated to English. .gitignore: whitelist config.py.	2026-02-20 20:56:52 +08:00

Author

SHA1

Message

Date

m1ngsama

f2ad220cc8

refactor(audio): move full pipeline into RobotEar.get_text(); add config constants

config.py:
- Add AUDIO_SILENCE_THRESHOLD, AUDIO_SILENCE_MARGIN, AUDIO_MIN_DURATION,
  AUDIO_MAX_DURATION so all audio tunables live in one place

whisper_main.py:
- RobotEar.get_text() now owns the full pipeline: silence trimming,
  duration guards, WAV write, Whisper transcription with all options
- _fix_recognition() moved here from RobotApp (ASR post-processing
  belongs in the ear layer, not the application layer)
- Add `import re`, `import config`; remove unused `sounddevice` import

voice_main.py:
- Remove `import scipy.io.wavfile` (WAV handling moved to whisper_main)
- get_audio_text() is now a one-liner: return self.ear.get_text(self.audio_frames)
- Remove _fix_recognition() (lives in RobotEar now)

Closes #9

2026-02-20 21:45:16 +08:00

m1ngsama

54c1dc8c01

refactor: extract magic constants to config.py (closes #7 )

Create config.py with env-var-overridable hardware constants:
- SERIAL_PORT / SERIAL_BAUD
- LLM_MODEL_PATH / YOLO_MODEL_PATH
- Z_HOVER / Z_GRAB / Z_AFTER_PICK  (Z_TABLE eliminated — same as Z_GRAB)
- REST_X / REST_Y / REST_Z         (replaces three copies of [120,0,60])
- WS_X / WS_Y / WS_Z              (workspace clipping bounds)
- DAMPING_BUFFER_SIZE / DAMPING_MAX_SPEED / DAMPING_FACTOR
- OFFSET_Y / OFFSET_Z

arm_main.py: port/baud/damping defaults now read from config.
voice_main.py: all magic constants replaced; remaining Chinese
non-i18n strings translated to English.
.gitignore: whitelist config.py.

2026-02-20 20:56:52 +08:00

2 commits