-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathdata_source_template.qmd
80 lines (64 loc) · 2.27 KB
/
data_source_template.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
## {Data Source}
```{r}
source(file.path(here::here(), "R/_load_pkgs.R"))
source(file.path(here::here(), "R/_quarto_helpers.R"))
source(file.path(here::here(), "R/_plotting_helpers.R"))
hookaddcap()
```
Introduction text
Data source description, type
- Quality rank (See @tbl-quality-rank)
- How, when, and why was the data collected?
- If this is a modeled dataset, what is the sample?
- What is the raw unit of measurement?
- How was this data accessed? Include any relevant links/citations, code, or downloads.
- What data cleaning or wrangling was completed? How did you test these processes and outputs?
- What is the geographic and temporal scope? Did you complete any aggregation?
- What version is the data? Were there other versions available? If so, why did you choose this version?
- What assumptions are made when we use this dataset?
- Which subject matter expert (SME) reviewed this data?
- Describe testing used to verify data
Be sure to add a citation of this dataset to the Zotero shared library.
### Data characteristics
- Were there any missing data? How did you handle missing data?
- Plots, tables, and description of data distribution
- Variance, Z-Score, quantiles
- Facet views by categorical variables
```{r fig-scatter}
#| fig-cap: "A scatter plot"
#| out-width: "95%"
#| include: false
shinipsum::random_ggplotly(type = "point") %>%
councilR::plotly_layout(
main_title = "Scatter"
)
```
```{r fig-boxplot}
#| fig-cap: "A boxplot"
#| out-width: "95%"
#| include: false
shinipsum::random_ggplotly(type = "boxplot") %>%
councilR::plotly_layout(
main_title = "Boxplot"
)
```
```{r fig-density}
#| fig-cap: "A density plot"
#| out-width: "95%"
#| include: false
shinipsum::random_ggplotly(type = "density") %>%
councilR::plotly_layout(
main_title = "Density"
)
```
### Limitations
### Comparison with similar datasets
### Data dictionary
Table with detailed description of columns and definitions for each data table.
```{r echo=FALSE}
caption_index <- readRDS(file.path(here::here(), "caption_index.RDS"))
caption_index <- as.character(as.numeric(caption_index) + 1) %>% stringr::str_pad(width = 2, side = "left", pad = "0")
saveCap(paste0("cap-", caption_index))
saveRDS(caption_index, file.path(here::here(), "caption_index.RDS"))
```
{{< pagebreak >}}