Voice LLM Application - Experiment 05: Multimodal Document Analysis - Voice Dialogue
Experiment Preparation:
- Ensure Doubao AI (Volcengine) and iFlytek AI are connected (refer to Experiment 01, Experiment 02)
- Find documents as experiment material. Document import can be relative or absolute path. Relative path defaults to AI_online_voice/assets/text.docx
- Download related dependencies (skip if already installed)
pip install python-docxpip install openpyxl
Experiment Steps: (Ensure voice module is connected)
cd AI_online_voice# Enter main directorypython examples/05_voice_document_analysis.py# Run example program- After entering the program, according to terminal prompts, first enter y to enter document selection. Can choose absolute or relative path.
Terminal Operation Example:


Experiment Effect: Document upload → Multimodal LLM document analysis → Voice dialogue output, implementing document understanding and voice interaction
