HackathonAI / NLPFact-CheckingRocketHacks 2025Best Use of MongoDB Atlas

DeepTruth

AI-powered fact-checking for a noisy internet.

A Rocket Hacks project combining Gemini AI, DistilBERT, Django REST, React, MongoDB, and a Chrome extension to help users evaluate online claims before they spread.

Best Use of MongoDB Atlas

24h

Build sprint

70/30

Gemini / DistilBERT weighting

Interfaces: web app + extension

9.5K

Training data points

Dual

AI model fusion pipeline

01 — Overview

Because in a world full of noise, clarity is power

DeepTruth is an AI-powered fact-checking platform built for RocketHacks 2025. The project was designed to help users evaluate online claims, article titles, and suspicious headlines before misinformation has a chance to spread further.

The platform combines a web app, Chrome extension, AI reasoning, NLP classification, and source verification into one workflow. Users can enter an article title or analyze content while browsing, then receive a credibility score, veracity assessment, reasoning, confidence signal, and independent sources for cross-checking.

The goal was never to replace human judgment. The goal was to give users a faster first-pass credibility tool — one that helps them think critically before trusting or sharing information. DeepTruth won Best Use of MongoDB Atlas at RocketHacks 2025.

02 — Problem

Misinformation spreads because verification is slow

The internet makes information easy to access but difficult to trust. Misleading headlines, AI-generated content, and low-quality sources spread quickly because users often see claims without context, source comparison, or evidence-based verification. Most people do not have time to manually investigate every article they encounter.

Misleading titles

Headlines are designed to drive clicks, not accuracy. Framing alone can bias how a claim is understood.

No quick tools

Fact-checking sites exist but require manual navigation — creating friction that most users skip.

AI-generated content

Synthetic text can look credible and pass casual inspection, but carries no ground-truth accountability.

Source opacity

Users rarely know who is behind a claim or whether the source has a history of corrections.

Sharing before checking

Social media rewards speed — content spreads before it is verified, and retractions rarely catch up.

Single-source trust

Readers often rely on one outlet rather than cross-referencing multiple independent sources.

DeepTruth addresses this by turning fact-checking into a faster, AI-assisted workflow — reducing the friction between encountering a claim and getting credibility context for it.

03 — Solution

Two interfaces, one credibility pipeline

DeepTruth provides two primary user interfaces that feed into the same backend analysis pipeline.

Web App

Users input article titles or claims directly
Returns credibility score, veracity assessment, reasoning, and independent source links
Designed for direct claim verification before sharing

Chrome Extension

Brings credibility analysis directly into the browsing workflow
No need to leave the article page to begin verification
Meets users where misinformation actually appears

Both interfaces connect to the same Django REST API, which orchestrates Gemini AI reasoning, DistilBERT classification, and source verification before returning a structured result to the user.

API endpoints

POST

/api/verify-claim/

Analyze an article title or claim — returns credibility, veracity, reasoning, and source links.

GET

/api/false-news/

Retrieve previously analyzed or flagged claims stored in MongoDB.

04 — AI Approach

Model fusion over single-signal trust

The core technical decision behind DeepTruth is model fusion: combining two complementary AI systems rather than relying on one signal. A single model is both overconfident and undercalibrated for the complexity of credibility assessment.

Weighted fusion: 70% Gemini / 30% DistilBERT

70% — Gemini AI

30% — DistilBERT

More weight goes to Gemini because reasoning and explanation quality are primary user-facing values. DistilBERT adds a specialized classification signal without dominating the output.

Gemini AI — 70%

Google Gemini Pro handles high-level reasoning, claim interpretation, and explanation generation. It produces human-readable analysis of why a claim may be credible or suspicious — not just a score, but the reasoning behind it.

DistilBERT — 30%

A fine-tuned DistilBERT model provides NLP-based veracity classification. Trained on approximately 9,500 data points, it adds a language-model signal specialized for misinformation patterns that complements Gemini's broader reasoning.

Why fusion matters

A single model produces overconfident outputs. By combining Gemini's reasoning with DistilBERT's classification, DeepTruth surfaces more calibrated credibility signals — and makes the system more honest about what it knows and does not know.

05 — Architecture

Full-stack from browser to model inference

The system flows from user input through two interface layers, a REST API backend, AI inference, MongoDB storage, and back to a structured result.

User input → React app or Chrome extension → Django REST API → Gemini + DistilBERT → MongoDB → credibility result + reasoning + sources

01

Frontend — Web App

ReactViteHTML / CSS / JSClaim input formCredibility score displayResult explanation UI

02

Frontend — Chrome Extension

HTMLCSSJavaScriptChrome Extension APIBrowser-integrated analysis

03

Backend

DjangoDjango REST FrameworkClaim verification endpointModel orchestrationSource verification workflow

04

AI / NLP

Google Gemini ProFine-tuned DistilBERT70/30 weighted fusionVeracity classificationCredibility scoring

05

Database

MongoDBAnalyzed claim storageFalse-news history retrievalFlexible document schema

06 — Technical Stack

What DeepTruth runs on

FE

Frontend

ReactViteHTMLCSSJavaScript

BE

Backend

DjangoDjango REST FrameworkPython

AI

AI / ML

Google Gemini ProDistilBERTHugging FaceNLP classificationWeighted model fusion

DB

Database

MongoDB Atlas

EXT

Browser

Chrome Extension API

OPS

Tooling

GitHubNode.jsPython virtual environmentEnvironment variablesMongoDB connection config

ReactViteDjangoDjango REST FrameworkPythonGoogle Gemini ProDistilBERTHugging FaceMongoDBChrome ExtensionNLPHackathon

07 — Lessons Learned

What a 24-hour build teaches you

Building DeepTruth under hackathon constraints produced sharper design decisions than a longer timeline might have allowed. A few lessons that stuck:

Credibility systems need explanations

A score without reasoning is just noise. Users need to understand why the model flagged something before they can decide what to do with that signal.

Multi-model architectures are more honest

Trusting a single model produces overconfident outputs. Fusing two systems forces the architecture to be explicit about uncertainty and weighting choices.

Browser extensions are the right surface

Misinformation appears during browsing. A standalone fact-checker that requires a separate tab creates friction — an extension removes it.

Backend integration is harder than the demo

Connecting DistilBERT inference, Gemini API calls, MongoDB writes, and Chrome extension communication simultaneously under time pressure revealed integration debt quickly.

Hackathon projects need a sharp product story

Working code is necessary but not sufficient. The ability to communicate the problem, the audience, and the design tradeoffs clearly was as important as the implementation.

AI should assist, not replace, judgment

Framing matters. A tool that claims to decide truth is both overreaching and less credible than one that claims to help users think more carefully.

Responsible framing

DeepTruth should be read as an AI-assisted credibility platform, not an absolute truth detector.

“DeepTruth provides credibility signals, reasoning, and independent sources to support critical thinking.”

08 — Future Work

Where DeepTruth goes next

The hackathon prototype establishes a working pipeline. Future versions could transform it into a much more powerful misinformation-detection platform.

Near-Term

RAG pipeline for evidence retrieval from trusted sources before generating explanation
Source credibility database with reliability history, bias indicators, and correction patterns
Explainability dashboard showing which tokens influenced the verdict

Mid-Term

Deepfake detection for images, video, and audio
Reinforcement learning feedback loop using user ratings to improve scoring
Claim clustering to detect repeated misinformation narratives across articles

Long-Term

Browser-level non-intrusive warnings for highly suspicious headlines
Multi-language support for international misinformation detection
Model improvement pipeline using MongoDB-stored claim history as training feedback

The goal is not to replace human judgment. The goal is to give people faster credibility signals, clearer reasoning, and independent sources so they can think critically.

In a world full of noise, DeepTruth helps users slow down before they believe or share.

← Back to Projects GitHub ↗Watch Demo ↗