robot_arm

m1ngsama/robot_arm

Fork 0

mirror of https://github.com/m1ngsama/robot_arm.git synced 2026-03-25 19:53:49 +00:00

Commit graph

Author	SHA1	Message	Date
m1ngsama	f2ad220cc8	refactor(audio): move full pipeline into RobotEar.get_text(); add config constants config.py: - Add AUDIO_SILENCE_THRESHOLD, AUDIO_SILENCE_MARGIN, AUDIO_MIN_DURATION, AUDIO_MAX_DURATION so all audio tunables live in one place whisper_main.py: - RobotEar.get_text() now owns the full pipeline: silence trimming, duration guards, WAV write, Whisper transcription with all options - _fix_recognition() moved here from RobotApp (ASR post-processing belongs in the ear layer, not the application layer) - Add `import re`, `import config`; remove unused `sounddevice` import voice_main.py: - Remove `import scipy.io.wavfile` (WAV handling moved to whisper_main) - get_audio_text() is now a one-liner: return self.ear.get_text(self.audio_frames) - Remove _fix_recognition() (lives in RobotEar now) Closes #9	2026-02-20 21:45:16 +08:00
m1ngsama	6977061bef	fix(whisper): remove broken start_recording; move scipy import to top-level whisper_main.py: - Remove RobotEar.start_recording() and record_callback() which called the nonexistent sd.start_stream() API (correct API is sd.InputStream). These methods were never called by voice_main.py and contained a broken sounddevice API call that would raise AttributeError (#2). - Remove unused recording_buffer field - Translate Chinese comment/docstring to English (#5) voice_main.py: - Move `import scipy.io.wavfile as wav` from inside get_audio_text() function body to module top-level where all imports belong (#4 related) - Sort imports: stdlib before third-party, local last - Remove Chinese comment, replace with English equivalent	2026-02-20 20:24:24 +08:00
whisper11111111111	bb85c3266b	Initial commit for robot arm voice control system	2026-02-10 23:31:14 +08:00

Author

SHA1

Message

Date

m1ngsama

f2ad220cc8

refactor(audio): move full pipeline into RobotEar.get_text(); add config constants

config.py:
- Add AUDIO_SILENCE_THRESHOLD, AUDIO_SILENCE_MARGIN, AUDIO_MIN_DURATION,
  AUDIO_MAX_DURATION so all audio tunables live in one place

whisper_main.py:
- RobotEar.get_text() now owns the full pipeline: silence trimming,
  duration guards, WAV write, Whisper transcription with all options
- _fix_recognition() moved here from RobotApp (ASR post-processing
  belongs in the ear layer, not the application layer)
- Add `import re`, `import config`; remove unused `sounddevice` import

voice_main.py:
- Remove `import scipy.io.wavfile` (WAV handling moved to whisper_main)
- get_audio_text() is now a one-liner: return self.ear.get_text(self.audio_frames)
- Remove _fix_recognition() (lives in RobotEar now)

Closes #9

2026-02-20 21:45:16 +08:00

m1ngsama

6977061bef

fix(whisper): remove broken start_recording; move scipy import to top-level

whisper_main.py:
- Remove RobotEar.start_recording() and record_callback() which called
  the nonexistent sd.start_stream() API (correct API is sd.InputStream).
  These methods were never called by voice_main.py and contained a broken
  sounddevice API call that would raise AttributeError (#2).
- Remove unused recording_buffer field
- Translate Chinese comment/docstring to English (#5)

voice_main.py:
- Move `import scipy.io.wavfile as wav` from inside get_audio_text()
  function body to module top-level where all imports belong (#4 related)
- Sort imports: stdlib before third-party, local last
- Remove Chinese comment, replace with English equivalent

2026-02-20 20:24:24 +08:00

whisper11111111111

bb85c3266b

Initial commit for robot arm voice control system

2026-02-10 23:31:14 +08:00

3 commits