ai safety machine learning natural language processing synthetic data
A comprehensive dataset designed for aligning language models with safety and ethical guidelines. Contains 8,361 curated triplets of prompts, responses, and safe responses across various risk categories. Each entry includes safety scores, judge reasoning, and harm probability assessments, making it valuable for model alignment, testing, and benchmarking.
Static dataset, version 1.0 (Released December 2024)
Apache License 2.0 (https://www.apache.org/licenses/LICENSE-2.0)
https://huggingface.co/datasets/gretelai/gretel-safety-alignment-en-v1
See all datasets managed by Gretel.ai.
Gretel Synthetic Safety Alignment Dataset was accessed on DATE
from https://registry.opendata.aws/gretel-synthetic-safety-alignment-en-v1. @dataset{gretelai_gretel-safety-alignment-en-v1, title = {Gretel Synthetic Safety Alignment Dataset}, year = {2024}, month = {12}, publisher = {Gretel}, url = {https://huggingface.co/datasets/gretelai/gretel-safety-alignment-en-v1}}
arn:aws:s3:::gretel-datasets-public/gretel-synthetic-safety-alignment-en-v1
us-west-2
aws s3 ls --no-sign-request s3://gretel-datasets-public/gretel-synthetic-safety-alignment-en-v1/