Voice LLM Application - Experiment 05: Multimodal Document Analysis - Voice Dialogue

Experiment Preparation:

Ensure Doubao AI (Volcengine) and iFlytek AI are connected (refer to Experiment 01, Experiment 02)
Find documents as experiment material. Document import can be relative or absolute path. Relative path defaults to AI_online_voice/assets/text.docx
Download related dependencies (skip if already installed)

pip install python-docx
pip install openpyxl

Experiment Steps: (Ensure voice module is connected)

cd AI_online_voice # Enter main directory
python examples/05_voice_document_analysis.py # Run example program
After entering the program, according to terminal prompts, first enter y to enter document selection. Can choose absolute or relative path.

Terminal Operation Example:

Experiment Effect: Document upload → Multimodal LLM document analysis → Voice dialogue output, implementing document understanding and voice interaction