-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create 2022-07-26-phone-number-capture-dataset.md #108
base: master
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall structure is there - similar to this. But:
- the content is missing is some places, and
- needs more emphasis on the metrics and how to read them
|
||
# Introduction | ||
|
||
A bottleneck of a dialogue system is its ability to extract information from the utterances. The information is extracted in the form of **frames**, that represents all the different types of intentions that the system can extract from user utterances and **slots**, that are the different type of possible values. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is frames the same as intents ? lets use that to be consistent
|
||
## Sentence Error Rate | ||
|
||
Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Need more on how this is defined
1. **System performance** - this is captured through entity and slot metrics. in the case of alphanumeric here, we focus on SER (Sentence Error Rate) | ||
2. **User experience** - this is captured through a subjective UX score, assigned through analysis (by the CUX function) | ||
|
||
We expect, system performance (≈SER) to be better for entities captured across two turns - since we are parsing smaller sub-entities independently and (naively) expect SER to be a function of CER (Character Error Rate) and length. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we keep this note, then it needs more explanation. What is SER and why is it a function of these two ? seems like some assumptions are being skipped
|
||
Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system. | ||
|
||
We averaged the calculations across different callers, for each variation across different ASR systems. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
better sentence construction
We averaged the calculations across different callers, for each variation across different ASR systems. | ||
![image](https://user-images.githubusercontent.com/16001446/180979426-e4ddc17f-a5e6-4af6-9659-e3f4e5166fac.png) | ||
|
||
## UX Score |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This section needs a lot more! Can start with a summary of the UX report
> Single Turn is a more natural way to collect a phone number than Two turns. | ||
|
||
# Future Work | ||
## Validate on a larger dataset |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets skip this, and add confidence intervals instead
For example, for the above conversation to book a flight, the frame will be of the type: | ||
|
||
![image](https://user-images.githubusercontent.com/16001446/180951715-d0169425-8f17-443e-a823-23dff1e5af46.png) | ||
# Motivation |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this be merged with the previous section ? Introduction
isnt saying much really
categories: [Machine Learning] | ||
image: assets/images/demo1.jpg | ||
layout: post | ||
authors: [anirudhthatipelli] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
adding adithyanarayan
as a co-author since he worked on all the evaluations here
No description provided.