Last updated 11 months ago
DBSync is the best source of raw on-chain data, but it is highly normalized and hard to analyze or apply machine learning in its raw state.
Our plan is to build an open-source analytics-ready dataset of Cardano on-chain data
This is the total amount allocated to Dataset - On-Chain Analytics.
Problem Summary
DBSync data is highly normalized and hard to analyze or apply Machine Learning in its raw state.
Our Solution
Our plan is to build an open-source analytics-ready dataset of Cardano on-chain data which provides the community with an analytics-ready dataset of the Cardano blockchain.
The core data source will be Cardano DB-Sync, and the deliverable will be a Github repository of SQL views which create datasets to enable developers and data scientists to answer questions such as:
Deliverables
The output of this project would be an open-source Github repository containing data integration scripts and SQL Queries as well as associated schema documentation.
Additionally, if the Cardano Data Hub idea ( https://cardano.ideascale.com/a/dtd/Cardano-Analytics-Data-Hub/368258-48088 ) gets funded, this dataset will be integrated as one of the available datasets there.
Project Plan and Budget
The project will have 4 components:
Data Discovery and community requirement gathering - $1,200
Data Modelling - $1,200
SQL Development - $1,575
Documentation - $600
Total Budget - $4,575
Budgets are based on a $75/h USD hourly rate. No funds are being requested for infrastructure costs or development tools.
Expected completion date would be 6-8 weeks after funding is received.
Core Team Experience
Michael Stewart
Vivek Nankissoor
NB: Monthly reporting was deprecated from January 2024 and replaced fully by the Milestones Program framework. Learn more here
Founders of Cardano Canucks and Canuckz NFT with 30+ yrs experience in data infrastructure, analysis and data visualization for enterprises.