Create 2022-07-26-phone-number-capture-dataset.md #108

Anirudh257 · 2022-07-26T10:28:00Z

No description provided.

janaab11

Overall structure is there - similar to this. But:

the content is missing is some places, and
needs more emphasis on the metrics and how to read them

janaab11 · 2022-08-29T07:47:29Z

_posts/2022-07-26-phone-number-capture-dataset.md

+
+# Introduction
+
+A bottleneck of a dialogue system is its ability to extract information from the utterances.  The information is extracted in the form of **frames**, that represents all the different types of intentions that the system can extract from user utterances and **slots**, that are the different type of possible values.


is frames the same as intents ? lets use that to be consistent

janaab11 · 2022-08-29T07:49:06Z

_posts/2022-07-26-phone-number-capture-dataset.md

+
+## Sentence Error Rate
+
+Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system. 


Need more on how this is defined

janaab11 · 2022-08-29T07:50:09Z

_posts/2022-07-26-phone-number-capture-dataset.md

+1. **System performance** - this is captured through entity and slot metrics. in the case of alphanumeric here, we focus on SER (Sentence Error Rate)
+2. **User experience** - this is captured through a subjective UX score, assigned through analysis (by the CUX function)
+
+We expect, system performance (≈SER) to be better for entities captured across two turns - since we are parsing smaller sub-entities independently and (naively) expect SER to be a function of CER (Character Error Rate) and length.


If we keep this note, then it needs more explanation. What is SER and why is it a function of these two ? seems like some assumptions are being skipped

janaab11 · 2022-08-29T07:50:36Z

_posts/2022-07-26-phone-number-capture-dataset.md

+
+Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system. 
+
+We averaged the calculations across different callers, for each variation across different ASR systems.


better sentence construction

janaab11 · 2022-08-29T07:51:17Z

_posts/2022-07-26-phone-number-capture-dataset.md

+We averaged the calculations across different callers, for each variation across different ASR systems.
+![image](https://user-images.githubusercontent.com/16001446/180979426-e4ddc17f-a5e6-4af6-9659-e3f4e5166fac.png)
+
+## UX Score


This section needs a lot more! Can start with a summary of the UX report

janaab11 · 2022-08-29T07:51:50Z

_posts/2022-07-26-phone-number-capture-dataset.md

+> Single Turn is a more natural way to collect a phone number than Two turns.
+
+# Future Work
+## Validate on a larger dataset


lets skip this, and add confidence intervals instead

janaab11 · 2022-08-29T07:52:58Z

_posts/2022-07-26-phone-number-capture-dataset.md

+For example, for the above conversation to book a flight, the frame will be of the type:
+
+![image](https://user-images.githubusercontent.com/16001446/180951715-d0169425-8f17-443e-a823-23dff1e5af46.png)
+# Motivation


Can this be merged with the previous section ? Introduction isnt saying much really

janaab11 · 2022-08-29T07:59:42Z

_posts/2022-07-26-phone-number-capture-dataset.md

+categories: [Machine Learning]
+image: assets/images/demo1.jpg
+layout: post
+authors: [anirudhthatipelli]


adding adithyanarayan as a co-author since he worked on all the evaluations here

Create 2022-07-26-phone-number-capture-dataset.md

9b4daba

swarajdalmia requested a review from janaab11 August 1, 2022 09:52

janaab11 suggested changes Aug 29, 2022

View reviewed changes

janaab11 reviewed Aug 29, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create 2022-07-26-phone-number-capture-dataset.md #108

Create 2022-07-26-phone-number-capture-dataset.md #108

Anirudh257 commented Jul 26, 2022

janaab11 left a comment

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022

janaab11 Aug 29, 2022 •

edited

Loading


		# Introduction

		A bottleneck of a dialogue system is its ability to extract information from the utterances. The information is extracted in the form of frames, that represents all the different types of intentions that the system can extract from user utterances and slots, that are the different type of possible values.


		## Sentence Error Rate

		Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.


		Sentence Error Rate is a robust extension to WER(Word Error Metric), used to evaluate the working of an ASR system.

		We averaged the calculations across different callers, for each variation across different ASR systems.

Create 2022-07-26-phone-number-capture-dataset.md #108

Are you sure you want to change the base?

Create 2022-07-26-phone-number-capture-dataset.md #108

Conversation

Anirudh257 commented Jul 26, 2022

janaab11 left a comment

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022

Choose a reason for hiding this comment

janaab11 Aug 29, 2022 • edited Loading

Choose a reason for hiding this comment

janaab11 Aug 29, 2022 •

edited

Loading