Improvements arising from suggestions evt #132

eugenividal · 2025-01-14T21:21:17Z

@e-kotov and @Robinlovelace, in this PR I've:

In the README:
Rephrased the paragraph explaining the different versions of the data based on the improvements on the paper.
In the vignettes:
Renamed the mobility datasets as follows:
+ Version 1, Mobility data:
- Origin-Destination data
- Population by trip count data
+ Version 2, Mobility data:
- Origin-Destination data
- Population by trip count data
- Population by overnight stay data

These were the main tasks from #129. As for the rest, I found some issues:

In the README:
Given that the title is done with one #, add one level (i.e. ##) to the headings 1. I've tried this but there are several qmd files linked to the README, so to make it work we would have to lower one level in all the files, or we can just forget about this very minor issue. It is just aesthetics
plot the desire lines using the flow mapper.
Not sure where this plot comes from
Remove the title of the first figure --- there is already a caption with title for this figure.
Same as above
Beautify Figure 4, checking for typos (e.g. munucipality, houlry), putting the first three boxes at the same level, etc.
Same as above
In the package:

These suggestions regarding function names were also very minor. If you agree, better if you implement them @e-kotov. I am not sure how to rename functions.

Fix inconsistencies in the name of the variables of the different versions: e.g. date and full_date, and residence_province_name and residence_province.
I'd consider renaming some variables: E.g. time slot = hour_slot and distance = interval_distance

Robinlovelace

👍 from me, thoughts @e-kotov ?

e-kotov · 2025-01-14T23:08:39Z

thoughts

@Robinlovelace Wondering why ubuntu devel suddenly fails, though @eugenividal did not touch the code ) Probably has something to do with ubuntu latest moving to v24 in github actions... Have not studied the logs yet.

@eugenividal are you hesitating to touch the code and the docs?

Some of the changes in the vignette are consistent with the package docs and argument options.

'Population by trip count data' - I'm not sure this reads well. In the package docs we refer to it as "number_of_trips" (see help for spod_get, spod_download, spod_convert). So if we change it in vignette, we need to make it consistent with the package options and docs. But the package is already released. Not that many people have downloaded it, but I would be hesitant to suddenly change the arguments and make breaking changes. So at this point I am hesitant to change the vignette in this way, especially if the new heading does not read well (to me at least it does not). One of the problems is this specific data set that has this weird form/grouping of number or people by number of trips as a factor... Could we try alternative headings for this one that would be easy to understand but also consistent with the docs?

eugenividal · 2025-01-15T08:55:08Z

Hi @e-kotov, yeah, not sure what happened with this PR.
The most important change is the definition of the data versions in the README, which is implemented in the files changed. This part is now correct—previously, it was misleading regarding the differences between versions.
As for the name of the current 2.2. 'number of trips' data in the vignettes, I find it a bit confusing. Let me explain this. For instance, in version 1, the first dataset (maestra1) provides the number of trips (and trip kilometers) per origin and destination, we call this origin destination data (which seems ok). In contrast, the second dataset (maestra2) gives the number of people by trip count category (0, 1, 2, 2+), we call this number of trips data. Referring to maestra2 as 'number of trips' can be a little misleading since it actually represents the number of individuals per trip frequency, while maestra1 actually provides the number of trips by origin and destination. Do you see what I mean? That said, it’s probably not incorrect—just a bit ambiguous. If it works for you, we can leave it as is. It may be just my own interpretation. Do you find the title of these headings (2.1. origin-destination data and 2.2. number of trips data) in the vignettes correspond well with the tables they show, @Robinlovelace?
Regarding the rest of suggestions, they’re mostly aesthetic and subjective. I didn’t make any changes yet because I couldn’t find the original code for the figures yesterday. Let me know if you’d like me to try to implement these changes another day if you think the suggestions could be useful.

e-kotov · 2025-01-15T10:32:39Z

@eugenividal I checked the main branch and the tests are failing for the main as well, the reason is some bug that I will attend later, it's not critical.

This 'number of trips' is super confusing and hard to explain, but I think we should try )

I would ask for some patience and allow me some more time (about a week from now, as I'm a bit busy right now) to add to/review your changes as they may require changes in the package docs and I want it to be a cohesive thing before merging, I don't want to push to the main something that will be immediately reflected on the package website and may not represent what the CRAN version of the package does and says in the docs.

eugenividal · 2025-01-15T12:18:41Z

@e-kotov, absolutely, no rush and agree with your comments, better not to change anything unless it is a clear and worth it improvement! The package is already great. These are just minor suggestions that we can implement only if they make it better and do not represent big changes.

I agree, the 'number of trips' is quite confusing! :) But we'll find a way to clarify it. The dataset is really complex, and sometimes complexity can't be easily simplified.

I just did this experiment: I asked ChatGPT to suggest titles for the maestra 1 and 2 tables of version 1, but I think it struggled too. Here are the responses:

maestra 1: Travel Activity Data: Origins, Destinations, and Trip Details by Residence and Age Group
maestra 2: Number of Trips and People by District and Trip Category

I wouldn't say maestra 2 provides the 'Number of Trips and People by District and Trip Category', but only the 'Number People by District and Trip Category'.

eugenividal added 3 commits January 14, 2025 20:55

rephrase paragraph and add level

a96ab55

rename mobility data v1

31b0bb2

rename mobility datasets v2

59a0ddf

eugenividal requested review from e-kotov and Robinlovelace and removed request for e-kotov January 14, 2025 21:21

Robinlovelace approved these changes Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements arising from suggestions evt #132

Improvements arising from suggestions evt #132

eugenividal commented Jan 14, 2025 •

edited

Loading

Robinlovelace left a comment

e-kotov commented Jan 14, 2025

eugenividal commented Jan 15, 2025 •

edited

Loading

e-kotov commented Jan 15, 2025

eugenividal commented Jan 15, 2025 •

edited

Loading

Improvements arising from suggestions evt #132

Are you sure you want to change the base?

Improvements arising from suggestions evt #132

Conversation

eugenividal commented Jan 14, 2025 • edited Loading

Robinlovelace left a comment

Choose a reason for hiding this comment

e-kotov commented Jan 14, 2025

eugenividal commented Jan 15, 2025 • edited Loading

e-kotov commented Jan 15, 2025

eugenividal commented Jan 15, 2025 • edited Loading

eugenividal commented Jan 14, 2025 •

edited

Loading

eugenividal commented Jan 15, 2025 •

edited

Loading

eugenividal commented Jan 15, 2025 •

edited

Loading