Blog-explorers

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

d3bach authored a paper 8 days ago

MARQUIS: A Three-Stage Pipeline for Video Retrieval-Augmented Generation

d3bach authored a paper 8 days ago

Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction

juiceb0xc0de new activity 9 days ago

blog-explorers/README:Pending Access Request to Join HF Blog Explorers

View all activity

dippatel1994

posted an update 4 days ago

Post

997

To make revising LLM architectures and training methods faster, I created a deck of 180 visual flashcards. It started as a personal hobby, but slowly became cheat code for reviewing LLM concepts before technical interviews. People love it!

Swipe through these samples, and if you want to grab the full set or follow the project, the repo is here: https://github.com/llmsresearch/llm-flashcards.

juiceb0xc0de

in blog-explorers/README 9 days ago

Pending Access Request to Join HF Blog Explorers

#17 opened about 2 months ago by

AINovice2005

satpalsr

submitted a paper to Daily Papers 23 days ago

MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware

Paper • 2605.05945 • Published May 7 • 10

satpalsr

authored a paper 23 days ago

MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware

Paper • 2605.05945 • Published May 7 • 10

bshepp

posted an update 23 days ago

Post

173

A dead 2013 Butterfly Labs "Jalapeno" SHA-256 mining ASIC sat in a drawer for a decade. It became the excuse for a small, careful question: how much structure can a tiny, cheap model learn in SHA-256, and how would I know if I were fooling myself? (The ML runs on CPU and a HF job, not the ASIC; the dead miner is just the origin story.)

Three findings, written up honestly:

1. A sharp round-4 cliff. Round-reduced SHA-256 is ~100% distinguishable through 3 rounds, then collapses to chance at round 4 and stays there out to the full 64. Reproduced across 5 seeds.

2. A controls-gated bounded null on full SHA-256: no learnable structure above a ~0.22% resolution floor at n=4,000,000. That is a bounded null at this budget, not a claim that SHA-256 is random.

3. A "signal" in the iterated-hash dynamics that a permuted-label control unmasked as a label-prior artifact. The instrument caught its own false positive. That was the point of building the controls.

Negative results, stated with their resolution. The dataset carries the controls on every row.

Dataset: bshepp/round-reduced-sha256-learnability
Code (MIT) + full writeup: https://github.com/bshepp/bfl-asic

satpalsr

posted an update 26 days ago

Post

188

We're open-sourcing our infra with 10M+ frames of dataset!

We're releasing Stera, an open-source infra that turns an off-the-shelf device in your pocket into a high-fidelity multimodal data pipeline. It's built around four layers. Capture → Process → Evaluate → Export.

Stera Capture removes the need for bespoke/gated hardware and runs on an off-the-shelf iPhone. It fuses together synchronized RGB, IMU, Lidar-guided depth, and 6-DoF pose out of the box from ARKit and exports them to a raw MCAP file.

Dataset: fpvlabs/stera-10m
Launch Details: https://x.com/fpv_labs/status/2055262652033908832

merve

updated a Space 28 days ago

README

🐨

urroxyz

in blog-explorers/README 29 days ago

[Support] Community Articles

🤝🔥 2

106

#5 opened about 2 years ago by

victor

Reality123b

in blog-explorers/README 29 days ago

[Support] Community Articles

🚀🤝 2

106

#5 opened about 2 years ago by

victor

Reality123b

in blog-explorers/README about 1 month ago

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

ZennyKenny

in blog-explorers/README about 1 month ago

🚩 Report: Spam

#19 opened about 1 month ago by

ccocks-deca

MohamedRashad

in blog-explorers/README about 1 month ago

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

ccocks-deca

in blog-explorers/README about 1 month ago

🚩 Report: Spam

#19 opened about 1 month ago by

ccocks-deca

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

AbstractPhil

in blog-explorers/README about 1 month ago

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

apehex

in blog-explorers/README about 1 month ago

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

RiverRider

in blog-explorers/README about 1 month ago

Future of Agentic Models

🔥 2

#18 opened about 1 month ago by

MohamedRashad

Yann-CV

posted an update about 1 month ago

Post

503

🚀 Introducing Goldener: The Python Data Orchestrator for more efficient ML

Machine Learning workflows often rely on randomness: selecting/splitting data for training, batching it variably, and monitoring real-world performance.

Nowadays, foundation models give access to the semantics of data. Goldener leverages this semantic to make the entire ML lifecycle more efficient!

🔗 Check it out: https://github.com/goldener-data/goldener
🔨 Give it a try: pip install goldener

intrect

posted an update about 2 months ago

Post

174

I’m excited to share a new paper I recently posted on arXiv: ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics.

This work asks a simple question: can AI-generated music be detected not only by style, but by the physical artifacts left behind during generation?

ArtifactNet approaches the problem from that angle. Instead of only learning what AI music sounds like on a fixed benchmark, it analyzes forensic residual patterns linked to neural audio codec bottlenecks such as residual vector quantization (RVQ).

In our experiments, ArtifactNet achieved F1 = 0.9829 on a zero-overlap multi-generator benchmark spanning 22 AI generators and 6 real-music sources, while using only 4.0M parameters. Under the same evaluation setting, larger prior models showed substantial degradation on out-of-distribution generators and real-music false positives.

I also introduced ArtifactBench, a broader evaluation benchmark designed to stress-test detector robustness across unseen generators, diverse real sources, hard negatives, and codec conditions.

This was a deeply rewarding project at the intersection of audio forensics, MIR, and generative model evaluation.

https://arxiv.org/abs/2604.16254

1 reply

intrect

submitted a paper to Daily Papers about 2 months ago

ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics

Paper • 2604.16254 • Published Apr 17 • 3

AI & ML interests

Recent Activity

Team members 1,197

blog-explorers's activity

Pending Access Request to Join HF Blog Explorers

README

[Support] Community Articles

[Support] Community Articles

Future of Agentic Models

🚩 Report: Spam

Future of Agentic Models

🚩 Report: Spam

Future of Agentic Models

Future of Agentic Models

Future of Agentic Models

Future of Agentic Models