WebCred Dataset — Annotations for Online Source Reliability

A curated dataset of 500 websites annotated for the accuracy of their content (labels: Correct, Incorrect, Partially Correct), plus simple visualization and evaluation tooling.

What's included

website_dataset.json: the dataset (each entry includes id, url, topic, label, and reasoning).
visualize.html: lightweight interactive charts for exploring label and topic distributions.
eval.py: example evaluation harness that queries a web-enabled LLM and saves outputs to model_behavior_outputs.json.
model_behavior_outputs.json: example outputs from a model evaluation run.

Dataset Structure

Each item in website_dataset.json contains:

id (string): unique identifier
url (string): source URL
topic (string): subject/domain
label (string): one of Correct, Incorrect, Partially Correct
reasoning (string): human explanation for the assigned label

Visualizing the data

The easiest way is to serve the repository and open visualize.html in a browser (most browsers block local file access for JSON):

python -m http.server 8000
# then open http://localhost:8000/visualize.html

Running the evaluation (`eval.py`)

eval.py is an example script that:

loads website_dataset.json
constructs prompts for each item
calls an LLM (via NVIDIA/Tavily tool bindings) to fetch web evidence and generate a response
writes results to model_behavior_outputs.json

Prerequisites and notes:

The script expects two API keys as environment variables: NVIDIA_API_KEY and TAVILY_API_KEY.
Install required Python packages used in eval.py before running (the script uses langchain_core, langchain_nvidia_ai_endpoints, and langchain_tavily bindings).
Run the script with:

python eval.py

Outputs from a run are saved to model_behavior_outputs.json in the repository root.

License

This work is licensed under MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
README.md		README.md
Website_dataset.json		Website_dataset.json
eval.py		eval.py
model_behavior_outputs.json		model_behavior_outputs.json
visualize.html		visualize.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WebCred Dataset — Annotations for Online Source Reliability

Dataset Structure

Visualizing the data

Running the evaluation (`eval.py`)

License

About

Uh oh!

Releases

Packages

Languages

License

Finance-LLMs/WebCred-Dataset

Folders and files

Latest commit

History

Repository files navigation

WebCred Dataset — Annotations for Online Source Reliability

Dataset Structure

Visualizing the data

Running the evaluation (eval.py)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Running the evaluation (`eval.py`)

Packages