Timestamp of the end of the segment, in milliseconds.
Length of the segment, in milliseconds.
The id of the speaker.
Timestamp of the beginning of the segment, in milliseconds.
Concatenated words of the segment.
Words decoded for the segment.
Timestamp of the end of the segment, in milliseconds.