Close Menu
Crypto Breaking News
    Crypto Breaking News
    • News
      • Press Release
      • Featured
      • Events
      • Exchanges
      • Bitcoin
      • Ethereum
      • Solana
      • Cardano
      • Ripple
      • Press Releases by PR Newswire
      • News by CoinPedia
      • News by Coincu
      • News by Blockchain Wire
      • Binance News
    • Crypto
      • Companies
      • Events
      • Partners
      • Buy Crypto
      • Timers
    • Advertise
      • Submit a Press Release
      • Logos
      • About
      • Services
    • Offers
      • Marketing Services
      • Wallets & Tools
    • Account
    • Video
    • Contact
    Submit PR
    Crypto Breaking News
    Crypto News Press Release

    Tether Releases QVAC Genesis II

    Expanding the World’s Largest Synthetic Educational Dataset to 148 Billion Tokens
    22 December 2025Updated:22 December 2025
    FacebookTwitterLinkedInCopy Link
    News Feed
    Google NewsRSS
    Tether Releases Qvac Genesis Ii
    Tether Releases Qvac Genesis Ii
    22 December, 2025 – Tether Data’s AI research division, QVAC, today announced the release of QVAC Genesis II, a major expansion of the world’s largest publicly available synthetic educational dataset for artificial intelligence pre-training. With the addition of 107 billion new tokens, the combined QVAC Genesis dataset now totals 148 billion tokens across 19 educational domains, significantly extending the scale, depth, and reasoning quality of open AI training data.
    QVAC Genesis II builds directly on the foundation laid by QVAC Genesis I, which introduced a rigorously validated, education-focused synthetic dataset spanning core STEM disciplines. This second release expands coverage to 10 new domains, including chemistry, computer science, statistics, machine learning, astronomy, geography, econometrics, and electrical engineering, while also regenerating college-level physics using an improved methodology. Together, Genesis I and II form the most comprehensive synthetic educational dataset ever released to the public.
    At the core of this release is a new data generation approach called Option-Level Reasoning, designed to extract structured reasoning not only from model failures, but also from correct answers. Rather than treating correct responses as finished outputs, this method systematically analyzes every answer option in a multiple-choice question, reinforcing correct reasoning while explicitly addressing common misconceptions. The result is training data that emphasizes clarity, causality, and decision-making, not just surface-level correctness.
    This new approach complements the original Failure Analysis method introduced in Genesis I, forming a dual-method pipeline that ensures every generated question contributes educational value. Independent evaluations show that models trained on Genesis II data demonstrate substantially higher reasoning accuracy and produce clear, unambiguous answers far more consistently than models trained on prior synthetic datasets.
    More than a scale increase, this release reflects a deliberate shift in how educational AI data should be built. While much of the industry focuses on scraping and aggregating ever-larger volumes of text, QVAC’s approach is designed to teach models how to think, reason, and explain, grounding intelligence in understanding rather than imitation.
    “Most AI training today optimizes for fluency, not understanding,” said Paolo Ardoino, CEO of Tether. “With this release, we’re pushing beyond volume toward structure, reasoning, and clarity. Intelligence should be built on understanding why something is true, not just predicting what sounds right. By making this dataset open, we’re giving researchers and builders the tools to develop AI that is more reliable, more explainable, and ultimately more useful to society.”
    As with Genesis I, the expanded dataset is released openly to support researchers, academic institutions, and independent developers working outside of closed, proprietary systems. It is made available under a Creative Commons Attribution–NonCommercial (CC-BY-NC 4.0) license, reinforcing QVAC’s commitment to open, community-driven AI research.
    The release continues QVAC’s broader mission to advance local, decentralized intelligence, where AI models can be trained, refined, and deployed without dependence on centralized cloud platforms. By strengthening the open foundations of AI training data, Tether Data aims to reduce structural barriers to innovation and ensure that high-quality intelligence remains accessible to the global research community.
    The full technical breakdown of the dataset, titled “QVAC Genesis II: Expanding the Largest and Highest-Quality Multi-domain Educational Synthetic Dataset for Pre-training” is available now via the QVAC research blog, alongside access to the dataset and models on Hugging Face.
    Further information, including a detailed FAQ section, is available on the QVAC Website.

    Risk & affiliate notice: Crypto assets are volatile and capital is at risk. This article may contain affiliate links. Read full disclosure

    Crypto Breaking News
    • Website
    • Facebook
    • X (Twitter)
    • Pinterest
    • Instagram
    • Tumblr
    • LinkedIn

    The Crypto Breaking News editorial team curates the latest news, updates, and insights from the global cryptocurrency and blockchain industry.

    Related Posts

    Malaysia's Central Bank Unveils Stablecoin & Tokenization Sandbox

    Malaysia’s Central Bank Unveils Stablecoin & Tokenization Sandbox

    Bitcoin Surges After Us Jobs Beat As Fed Pause Odds Near 95%

    Bitcoin Surges After US Jobs Beat as Fed Pause Odds Near 95%

    Search Crypto News

    Join 17,000+ Crypto Followers

    • Facebook2.3K
    • Twitter4.3K
    • Instagram5.6K
    • LinkedIn4K
    • Telegram52
    • Threads800
    Tangem 300x300
    Megacampus Summit Dubai 2026

    About Crypto Breaking News

    About Crypto Breaking News

    Crypto Breaking News is a fast-growing digital media platform focused on the latest developments in cryptocurrency, blockchain, and Web3 technologies. Our goal is to provide fast, reliable, and insightful content that helps our readers stay ahead in the ever-evolving digital asset space.

    Web3 Digital L.L.C-FZ
    License Number: 2527596
    📞 +971 50 449 2025
    ✉️ info@cryptobreaking.com
    📍Meydan Grandstand, 6th floor, Meydan Road, Nad Al Sheba, Dubai, United Arab Emirates

    FacebookX (Twitter)InstagramPinterestYouTubeTumblrBlueskyLinkedInRedditTikTokTelegramThreadsRSS

    Links

    • Crypto News
    • Submit a Press Release
    • Advertise
    • Contact Us
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions

    advertising

    Global AI Show - Riyadh
    © 2026 CryptoBreaking.com | All rights reserved | Powered by Web3 Digital & Osom One

    Type above and press Enter to search. Press Esc to cancel.

    Change Location
    Find awesome listings near you!