Here are entries from the log:
10:10:09:717
[LOG]
Transcriber output: My name is Stewart,
10:10:09:732
[LOG]
Endpointing timeout 2496ms
10:10:10:280
[CHECKPOINT]
Assistant speech started
10:10:10:280
[INFO]
Turn latency: 562ms (transcriber: 0ms, endpointing: 0ms, model: 0ms, voice: 0ms)
10:10:12:230
[CHECKPOINT]
Model request started
10:10:12:233
[LOG]
Model request started (attempt #1, gpt-4o-mini-2024-07-18, azure-openai, eastus)
10:10:12:439
[LOG]
Model output: Thanks
10:10:12:439
[CHECKPOINT]
Model sent first output token
10:10:12:439
[CHECKPOINT]
Model sent start token
10:10:12:446
[LOG]
Model output: ,
10:10:12:450
[LOG]
Model output: Stewart
10:10:12:457
[LOG]
Model output: !
10:10:12:465
[LOG]
Model output: Could
10:10:12:468
[LOG]
Model output: you
10:10:12:476
[LOG]
Model output: please
10:10:12:481
[LOG]
Model output: spell
10:10:12:489
[LOG]
Model output: that
10:10:12:497
[LOG]
Model output: for
10:10:12:499
[LOG]
Model output: me
10:10:12:507
[LOG]
Model output: ?
10:10:12:514
[LOG]
Model request cost (attempt #1, $0.0003741, 2446 prompt, 12 completion)
10:10:12:514
[LOG]
Voice input: Thanks, Stewart! Could you please spell that for me?
10:10:12:514
[CHECKPOINT]
Model sent end token
10:10:16:407
[CHECKPOINT]
Assistant speech stopped
But the transcript shows: