DARPA Communicator Testbed


Log Standard Implementation of DMA Metrics (v4)

Contents


Introduction

The purpose of this document is to provide some guidance in logging such that we all collect the appropriate metrics for evaluation. Most of the metrics rely on type_... attributes (XML tag attributes beginning with type_) and their values. In most cases these attributes may be assigned to one of a few tag types (as opposed to being restricted to a single tag type), to allow some flexibility in accomodating different systems. Human annotations should be stored in a separate XML file external to the main logfile, to facilitate multiple annotators and measurement of interannotator agreement. We envision that each log with reside in its own directory which also contains the associated human annotations as well as the audio (wav) files associated with the dialogue. The main log file should contain pointers to the audio (wav) files. Below are our specific recommendations for placement and naming of files associated with a call:


DMA Metrics

Task Completion

Time to Completion

Turns to Completion

Mean User Words/Turn

Mean System Words/Turn

Mean User Concepts/Turn

Mean Concept Efficiency

State of Itinerary

Error Messages

Help Messages

Response Latency

User Words to Completion

System Words to Completion

User Repeats

System Repeats/Reprompts

Word Error

Mean Length of System Utterance

Mean Length of System Turn


Summary of Attributes and Metrics

Attribute Values Metrics Tags
type_start_task total
task
true
Time to Completion
User Words to Completion
System Words to Completion
Turns to Completion
GC_OPERATION
GC_MESSAGE
GC_EVENT
type_end_task total
task
true
Time to Completion
User Words to Completion
System Words to Completion
Turns to Completion
GC_OPERATION
GC_MESSAGE
GC_EVENT
type_new_turn user
system
Turns to Completion
Mean User Words/Turn (user)
Mean System Words/Turn (system)
Mean Length of System Turn (system)
GC_OPERATION
GC_MESSAGE
GC_EVENT
type_utt_text transcription
system
asr
Mean User Words/Turn (transcription)
Mean System Words/Turn (system)
Error Messages (system)
Help Messages (system)
User Words to Completion (transcription)
System Words to Completion (system)
Word Error (transcription, asr)
GC_DATA
type_error_msg true Error Messages GC_DATA
type_help_msg true Help Messages GC_DATA
type_start_utt user
system
Response Latency (system)
Mean Length of System Utterance (system)
GC_OPERATION
GC_MESSAGE
GC_EVENT
type_end_utt user
system
Response Latency (user)
Mean Length of System Utterance (system)
GC_OPERATION
GC_MESSAGE
GC_EVENT
type_prompt (key being prompted for) System Repeats/Reprompts GC_OPERATION
GC_MESSAGE
GC_EVENT