Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
AI Safety Research's picture
29 46 314

AI Safety Research

AISafety
dvilasuero's profile picture CYGDEN's profile picture shtefcs's profile picture
ยท
https://humanaligned.ai

AI & ML interests

LLMs, planning, EA

Recent Activity

new activity about 11 hours ago
JetBrains/Mellum2-12B-A2.5B-Thinking:Vllm usage guide
liked a model about 11 hours ago
JetBrains/Mellum2-12B-A2.5B-Instruct
liked a model about 11 hours ago
JetBrains/Mellum2-12B-A2.5B-Thinking
View all activity

Organizations

Hugging Face Discord Community's profile picture ML intern explorers's profile picture ML Floppers's profile picture

Collections 3

Model building
  • Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

    Paper โ€ข 2511.06221 โ€ข Published Nov 9, 2025 โ€ข 135
Safety and transparency
  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

    Paper โ€ข 2504.07096 โ€ข Published Apr 9, 2025 โ€ข 77
Model building
  • Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

    Paper โ€ข 2511.06221 โ€ข Published Nov 9, 2025 โ€ข 135
Safety and transparency
  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

    Paper โ€ข 2504.07096 โ€ข Published Apr 9, 2025 โ€ข 77
View 3 collections

spaces 1

Runtime error
Agents

Magicoder

๐Ÿจ

Dec 5, 2023

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs