The center of the Datonic community. A place to improve the way the world produces, share, consume and collaborate on open datasets.
Aiming to share open data generated with open source software using open protocols running on open infrastructure.
These are some of the principles to keep in mind when thinking about Datonic projects.
- FAIR.
- Minimal (KISS) and flexible. Rely on tools that do one thing and do it well.
- No vendor lock-in.
- Rely on Open code, standards, and infrastructure.
- Use the tool you want to create, explore, and consume the datasets. Agnostic of any tooling or infrastructure provider.
- Standard format for data and APIs! Keep your data as future-friendly and future-proof as possible! E.g: publish data as Parquet files on S3.
- Modular ecosystem. Rely on standards like Arrow for IPC, and Frictionless for metadata.
- Distributed Permissionless ecosystem and collaboration. Open source code and make it ready to be improved.
- Data marketplace not linked to a specific company or platform.
- Declarative and reproducible workflows and datasets. Everything as code.
- Open and transparent governance and funding.
- Community that incentives contributors.
- Embrace inmutability and idempotency. Rely on content-addressable storage and append-only logs.
- Aim for stateless and serverless deployments. E.g. use GitHub Pages, host datasets on S3.
- Offline-first. Rely on static files and offline-first tools like Datasette.
- Aim for stateless and serverless deployments. E.g. use GitHub Pages, host datasets on S3. No servers to maintain, no databases to manage, no infrastructure to worry about. Keep infrastructure management lean.
- Above all, have fun and ejoy the process. π
- Reuse existing tools.
- Experiment mindset. Encourage trying new things (tools, processes, frameworks, ...).