Last updated 7 months ago
Plutus knowledge is scattered, hard to search, and often unreliable. Developers need a system that generates precise, citation-backed answers from curated Plutus resources.
Plutus RAG uses retrieval-augmented AI to deliver fast, accurate, citation-backed answers from curated Plutus docs, CIPs, and forums, helping developers build and audit smart contracts efficiently.
This is the total amount allocated to Wolfram Plutus RAG: The AI-Powered Knowledge Engine.
Please provide your proposal title
Wolfram Plutus RAG: The AI-Powered Knowledge Engine
Enter the amount of funding you are requesting in ADA
40000
Please specify how many months you expect your project to last
5
Please indicate if your proposal has been auto-translated
No
Original Language
en
What is the problem you want to solve?
Plutus knowledge is scattered, hard to search, and often unreliable. Developers need a system that generates precise, citation-backed answers from curated Plutus resources.
Supporting links
Does your project have any dependencies on other organizations, technical or otherwise?
No
Describe any dependencies or write 'No dependencies'
No dependencies
Will your project's outputs be fully open source?
Yes
License and Additional Information
We intend to license the project outputs under the Apache License 2.0, aligning with Cardano Foundation standards. Apache 2.0 chosen for patent protection and commercial compatibility while ensuring community can freely use, modify, and distribute. We believe this strikes a good balance between encouraging collaboration and protecting the project's integrity.
Please choose the most relevant theme and tag related to the outcomes of your proposal
AI
Mention your open source license and describe your open source license rationale.
We intend on utilizing Apache License 2.0, chosen specifically for its alignment with Cardano ecosystem standards and enterprise adoption requirements. This license provides explicit patent grants protecting contributors and users while allowing commercial use cases that ensure project sustainability.
How do you make sure your source code is accessible to the public from project start, and people are informed?
Code will be on a public GitHub from project start, with initial commits covering project details and documentation. We will post weekly updates in GitHub Discussions, Discord, and the Catalyst forum.
How will you provide high quality documentation?
This project will include a clear README, developer setup guides with screenshots and code snippets. The RAG pipeline will include examples and video walkthroughs to demonstrate the workflow from start to finish.
Please describe your proposed solution and how it addresses the problem
Plutus is the foundation of Cardano’s smart contract ecosystem, enabling secure, functional-programming-based decentralized applications. However, despite its capabilities, Plutus suffers from a documentation access problem:
Developers, auditors, and researchers are forced to manually gather information from scattered and inconsistent sources, slowing development and deterring new contributors.
The Plutus RAG project will build a Retrieval-Augmented Generation (RAG) AI system designed to be the authoritative technical knowledge engine for Plutus. It will unify and curate all relevant Plutus content into a single, searchable system and pair it with an AI assistant that provides accurate, context-aware, citation-backed answers.
To guarantee reliability, Plutus RAG will draw exclusively from vetted, authoritative materials, including:
We will aggregate and preprocess all Plutus-related documents, enriching them with metadata such as source, topic, and date to enable precise and efficient searching. This curated content will then be stored in a vector database, allowing for advanced semantic retrieval so queries can be matched by meaning, not just keywords.
When a user asks a question, the system will retrieve the most relevant context and pass it to a large language model, which will generate natural-language answers complete with in-line citations and links to original sources.
The backend will also expose REST APIs for seamless integration into IDEs, governance platforms, and other external workflows. Acting as an open-book reference for Plutus, this system will significantly improve accuracy, and strengthen smart contract security. Built with future expansion in mind, the RAG system will also support onboarding of additional data sources to keep pace with the evolving Cardano ecosystem.
Please define the positive impact your project will have on the wider Cardano community
Plutus RAG addresses a major challenge in the Cardano ecosystem by resolving the problem of scattered, inconsistent, and outdated information about Plutus. By consolidating official documentation, CIPs, community guides, and technical discussions into a single authoritative reference, it will make accurate, verified information instantly accessible, enabling faster, more secure smart contract development and auditing.
Expected Outcomes
What is your capability to deliver your project with high levels of trust and accountability? How do you intend to validate if your approach is feasible?
Wolfram’s competitive edge lies in the scale, precision, and maturity of its data curation and computational infrastructure, exemplified by the Wolfram|Alpha Computational Knowledge Engine and the Wolfram Knowledgebase, one of the largest curated computational datasets in the world. This infrastructure integrates thousands of authoritative sources, including government records, scientific databases, financial markets, and live data feeds.
The ingestion pipeline processes vast volumes of structured and unstructured content, applying rigorous normalization, consistency checks, and metadata enrichment to ensure data is always current and computation-ready. This capability, refined over decades, enables high-precision analytics, reasoning, and dynamic query generation across domains, predating and extending beyond the generative AI era.
Building on this expertise, Wolfram can create tailored data pipelines and knowledge architectures for Cardano’s Plutus. By integrating automated ingestion and enrichment with an LLM-powered retrieval-augmented generation (RAG) conversational layer, the system can enable semantic search and context-aware knowledge retrieval.
Organizational Strengths
Milestone Title
Plutus RAG Architecture, GitHub launch
Milestone Outputs
Core Plutus RAG system architecture defined and documented (data ingestion, vector DB, LLM integration, API layer). GitHub repository initialized with base code structure and Apache 2.0 license. Initial README with architecture diagrams and setup instructions.
Acceptance Criteria
Architecture will document and cover all core components and interfaces. The GitHub repository will be public with initial commits. A local environment setup will be verified using instructions in the README documentation.
Evidence of Completion
A gitHub repository link with initial commits and README file will be included. An architecture diagram will be included in the repo docs folder.
Delivery Month
1
Cost
5000
Progress
10 %
Milestone Title
Data Aggregation & Preprocessing Pipeline
Milestone Outputs
Aggregation of all identified Plutus-related sources (official docs, CIPs, GitHub repos, Cardano Foundation resources, forums, etc.). Preprocessing scripts for cleaning, normalizing, and adding metadata to documents. Vectorized dataset stored in preparation for vector database ingestion.
Acceptance Criteria
All listed sources are retrieved and stored in a raw AND processed formats in order to make both convenient use and also fact checking.
Scripts will be run successfully and generate searchable datasets for the verifier.
Evidence of Completion
GitHub repository updated with preprocessing scripts and metadata schema.
Dataset sample files with documented fields.
Delivery Month
2
Cost
10000
Progress
40 %
Milestone Title
Semantic Retrieval Engine & Vector Database Setup
Milestone Outputs
A vector database will be configured for semantic search on Plutus documents.
An embedding pipeline will be integrated for document indexing and retrieval.
And search API endpoints for semantic queries will be implemented.
Acceptance Criteria
A vector database will be operational with the ingested processed dataset.
Queries will return relevant ranked results for efficient verification.
And a search API will be documented with example requests and responses.
Evidence of Completion
GitHub repository updated with vector database configuration and search API source code. Demo video showing semantic search on indexed Plutus content.
Delivery Month
3
Cost
10000
Progress
60 %
Milestone Title
LLM Integration & Answer Generation
Milestone Outputs
A RAG pipeline integrated with a selected LLM to generate natural language answers. An in-line citation system will link back to original the sources in order to provide a clear understanding of what informed the answers.
Acceptance Criteria
The AI assistant via RAG returns context-aware answers which will support the desired output. A demonstration of the working system should show that citations consistently map to the correct document and section.
Evidence of Completion
GitHub updated with the with RAG + LLM integration modules. Sample Q&A logs with source references will be provided.
Delivery Month
4
Cost
10000
Progress
80 %
Milestone Title
REST API Integration, Documentation, Community Testing, Final Release
Milestone Outputs
REST API for external integration with IDEs, governance platforms, and workflows. Developer and user documentation. Video tutorials and examples. Community testing feedback incorporated into the final release.
Acceptance Criteria
A REST API passes acceptance testing forr automated and manual functional tests with adequate documentation able to be verified by Cardano community. REST API for external integration with IDEs, governance platforms, and workflows.
Evidence of Completion
GitHub repository containing final API code, documentation, and developer tools. Postman collection for API endpoints.
Delivery Month
5
Cost
5000
Progress
100 %
Please provide a cost breakdown of the proposed work and resources
The total project budget is ₳40,000, allocated to ensure each development stage is fully resourced, tied to measurable outputs, and aligned with the 5-month delivery plan.
Plutus RAG Architecture & GitHub Launch – ₳5,000
Funds the definition of the core system architecture, environment setup, and the creation of a public GitHub repository licensed under Apache 2.0. Includes architecture diagrams, initial README documentation.
Data Aggregation & Preprocessing Pipeline – ₳10,000
Covers the aggregation of authoritative Plutus sources (official docs, CIPs, GitHub repos, Cardano Foundation resources, and community discussions), along with preprocessing scripts for cleaning, normalizing, and adding metadata. Outputs a vector-ready dataset for ingestion into the semantic search engine.
Semantic Retrieval Engine & Vector Database Setup – ₳10,000
Funds the configuration of the vector database and the integration of the embedding pipeline for semantic search. Includes API endpoints for ranked, meaning-based document queries, with example requests/responses and a working search demo.
LLM Integration & Answer Generation – ₳10,000
Supports the integration of a retrieval-augmented generation pipeline with a large language model to produce context-aware, citation-backed answers. Includes an in-line citation framework and testing to ensure accuracy and source traceability.
REST API Integration, Documentation & Final Release – ₳5,000
Enables the delivery of the final REST API for integration with IDEs, governance platforms, and external workflows. Funds developer and user documentation, video tutorials, community testing, and incorporation of feedback into the final release.
How does the cost of the project represent value for the Cardano ecosystem?
The Plutus RAG initiative delivers exceptional value by combining Wolfram’s proven expertise in large-scale knowledge curation, retrieval-augmented generation (RAG), and computational reasoning with targeted, high-impact outcomes for the Cardano ecosystem.
The requested budget for Plutus RAG delivers both immediate and lasting value to the Cardano ecosystem:
By aligning cost with direct, high-impact benefits, Plutus RAG not only addresses an immediate developer need but also establishes a scalable foundation for ongoing innovation and ecosystem expansion.
Terms and Conditions:
Yes
Jon Woodard, CEO
Jon Woodard is the CEO at Wolfram Blockchain Labs, where Jon coordinates the decentralized projects that connect the Wolfram Technology ecosystem to different DLT ecosystems. Previously at Wolfram Research Jon worked on projects at the direction of Wolfram Research CEO Stephen Wolfram and prior to that was a member of the team who worked on the monetization strategies and execution for Wolfram|Alpha. Jon has a background in economics and computational neuroscience. He enjoys cycling in his spare time.
Steph Macurdy, Head of Research and Education
Steph Macurdy has a background in economics, with a focus on complex systems. He attended the Real World Risk Institute in 2019, lead by Nassim Taleb, and has been investing in the crypto asset space since 2015. He previously worked for Tesla as an energy advisor and Cambridge Associates as an investment analyst. Steph is a youth soccer coach in the Philadelphia area and is interested in permaculture.
Gaurav Vishal, Manager
Gaurav Vishal is a Manager in Wolfram Research’s Technical Consulting team with over 6 years of experience in designing and delivering computational applications and enterprise AI solutions. He specializes in full-stack development, data analysis, machine learning, with expertise in microservices architecture and distributed data systems. Since joining Wolfram in 2019, he has led projects ranging from AI-powered tutoring platforms and large-scale data integration pipelines to secure on-premise analytics systems for enterprise and government clients. Gaurav holds a B.Tech degree from IIT Bhubaneswar, where he earned the Institute’s Silver Medal for academic excellence. He also received the TSIL Research Partner Award for his research on heat transfer in coal-fired Sponge Iron Rotary Kilns. Known for his focus on computational efficiency, system reliability, and client satisfaction, Gaurav has contributed to high-impact technical solutions for Fortune 500 companies, academic institutions, and public sector organizations.
Soumya Ranjan Dhal, Application Developer
Soumya Ranjan Dhal is an Application Developer at Wolfram Technical Consulting, where he has delivered both backend and frontend solutions for complex applications. He has developed backend systems in C++ and frontend components in Swift, along with building scalable web applications using modern frameworks. Soumya has hands-on experience with AI/ML, large language models, and Retrieval-Augmented Generation systems. He holds a Bachelor’s degree in Computer Science and Engineering and applies his expertise to create innovative, data-driven solutions in education, image processing, and other domains.
Rahul Kar, Quality Assurance
Rahul Kar works as a QA Engineer at Wolfram Research and enjoys ‘breaking’ software so it won’t break for end users. He automates repetitive regression testing with Playwright, TestNG, and Java, while adding a manual touch through exploratory testing to uncover hidden bugs. Previously, as a QA Executive at Vodafone, he worked with Selenium-based automation in telecom domain to ensure product quality. Outside of work, he enjoys reading comics.
Gabriela Guerra Galan, Project Manager
Gabriela Guerra Galan: Gabriela has 15+ years of experience leading projects. She is a certified PMP and Product Owner with bachelor's degree in Mechatronical Engineering, complemented by a master's degree in Automotive Engineering. As the co-founder of Bloinx, a startup that secured funding from the UNICEF Innovation Fund, she has demonstrated a passion for driving innovation and social impact.