Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pivot widget [was: Confusion Matrix for ordinary data] #2860

Closed
astaric opened this issue Jan 10, 2018 · 5 comments
Closed

Pivot widget [was: Confusion Matrix for ordinary data] #2860

astaric opened this issue Jan 10, 2018 · 5 comments
Assignees

Comments

@astaric
Copy link
Member

astaric commented Jan 10, 2018

I want to compare cluster assignments to the ground truth (a column in original data that was not used to do the clustering).

I can do that in Box Plot, but there is currently no way to show this in a Confusion Matrix.

@kernc
Copy link
Contributor

kernc commented Jan 10, 2018

For "ordinary data", this sounds like a Pivot Table.

@astaric
Copy link
Member Author

astaric commented Jan 10, 2018

That link on Pivot Table is not working, where can I find one? :)

@janezd janezd added the needs discussion Core developers need to discuss the issue label Jan 25, 2019
@janezd janezd assigned BlazZupan and janezd and unassigned BlazZupan and janezd Jan 31, 2019
@janezd janezd removed the needs discussion Core developers need to discuss the issue label Feb 1, 2019
@janezd
Copy link
Contributor

janezd commented Feb 1, 2019

We need to see Pivot in Excel and in Orange 2, design GUI, discuss it, implement it.

@janezd janezd changed the title Confusion Matrix for ordinary data Pivot widget [was: Confusion Matrix for ordinary data] Feb 1, 2019
@ajdapretnar
Copy link
Contributor

ajdapretnar commented Apr 11, 2019

I did a little research. There are two things that pivots can display - contingency tables and aggregates.

Orange 2

The old Orange 2 showed a version of contingency table. The user can select a discrete attribute, whose values are rows, and a discrete attribute, whose values are column names. The third attribute is used as the value of cells and can be either discrete or continuous (different aggregations apply).
Screen Shot 2019-04-11 at 13 20 58

Orange 3

The new Orange 3 already has Contingency table in Prototypes. It shows counts for discrete attributes (no third attribute, only the size of subgroups).
Screen Shot 2019-04-11 at 13 26 32

Excel

Pivot in Excel is a bit different (much more complicated). In a simple form, they allow you to use any kind of attribute for rows (even continuous). Then the user can select which attributes she wants to show in the pivot table. We are not talking about a values of a single discrete attribute, but aggregates of attributes in a data table. Each attribute can be given a specific kind of aggregation.
Screen Shot 2019-04-11 at 13 25 32

The fun part is that you can easily create contingency table by selecting two discrete attributes (one for row, one for column) and duplicate one of the two to values. Really neat.
Screen Shot 2019-04-11 at 13 46 35

I think it makes no sense to try to be like Excel, but we can still have a powerful widget with a simple interface if done right. Perhaps one option is to have a contingency table when only one discrete attribute is selected as a column.

@janezd
Copy link
Contributor

janezd commented Jun 1, 2019

Implemented via #3823

@janezd janezd closed this as completed Jun 1, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants