-
Notifications
You must be signed in to change notification settings - Fork 16
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Add Airbnb Reviews Data * Update changelog.md * Lint * Subsample listing_id from data * Update Airbnb Reviews with Multiple Cities * Downsample further and update readme * Linting * Rounded up ratings
- Loading branch information
1 parent
c8a8893
commit 03b2d9c
Showing
6 changed files
with
68 additions
and
18 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -3,4 +3,5 @@ | |
load_covid_metadata, | ||
load_youtube, | ||
load_youtube_metadata, | ||
load_airbnb_reviews, | ||
) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
## Modified Airbnb Reviews Demo Dataset Attribution | ||
|
||
The demo dataset used in this package has been modified from its original version, which is licensed under the [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/). | ||
|
||
**Original Dataset Information:** | ||
|
||
- Titles: | ||
New York City, New York, United States - Detailed Review Data (05 June, 2023) | ||
London, England, United Kingdom - Detailed Review Data (05 June, 2023) | ||
Paris, Île-de-France, France - Detailed Review Data (05 June, 2023) | ||
San Francisco, California, United States - Detailed Review Data (05 June, 2023) | ||
- Author(s): Inside Airbnb | ||
- License: [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/) | ||
- Source: [Inside Airbnb Data](http://insideairbnb.com/get-the-data) | ||
|
||
**Modifications:** | ||
|
||
In this package, we have made the following modification to the original dataset: | ||
|
||
- Combined data from New York City, London, Paris, and San Francisco | ||
- Applied sentiment analysis to the comments column to derive rating scores for each entry. | ||
- Subsampled the listing_id to make for quicker analysis. | ||
|
||
Please note that while we have made modifications to the dataset, the original data is still covered under the CC BY 4.0 license. The modifications performed in this package do not change the license of the original dataset. | ||
|
||
The dataset is provided as-is, and the authors do not bear any responsibility for the usage or accuracy of the data or the sentiment analysis results. |
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters