Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

prithivMLmods 
posted an update 3 days ago
view post
Post
4683
I've made 8 Spaces in the Qwen-Image-Edit series, and out of them, 5 Spaces reached “Space of the Week”! A few Spaces are still topping the list even after many months.

Cumulatively, the series has crossed 8.2 million+ ZeroGPU runs and nearly 4 million visitors overall.

Thanks for all the community support! 🤗❤️

🔗 Spaces: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
  • 4 replies
·
PhysiQuanty 
posted an update 3 days ago
AxionLab-official 
posted an update about 11 hours ago
view post
Post
54
Someone ran Supra-50M-Instruct ON A 1GHZ 1999 CPU

https://www.reddit.com/r/LocalLLM/comments/1tm21ar/i_see_your_strix_halo_and_raise_you_a_vintage/

"As a fun experiment, I decided to try running the recently released Supra-50m on a 26-year-old machine I keep for retro Windows 9.X games. Although the model was somewhat silly and inconsistent, the performance wasn't bad, reaching around 1.3 tok/s with CPU inference alone.

Since this CPU doesn't have SSE2, I changed from llama.cpp to llama2.ce and asked Claude to write a custom tokenizer.

It's crazy to think that with the right file size of 200 MB, we could have experienced this magic back in 1999" - u/drone_stonks, r/localllm
pankajpandey-dev 
posted an update 1 day ago
view post
Post
112
Just released Qwen3-0.6B fine-tuned on Hindi instruction data 🇮🇳

✅ Full model: pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1
✅ GGUF versions (Q2/Q4/Q5/Q8): pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1-GGUF

Smallest Hindi-capable GGUF — runs on any laptop at 0.37GB.
Next: v2 with more data, better responses.

#Hindi #LLM #GGUF #OpenSource
Juanxi 
posted an update 1 day ago
view post
Post
108
🧐 BlogXiv provides a scholarly discovery interface for researchers to trace emerging ideas, compare technical perspectives, and engage with high-quality research communication.

Welcome any contributions such as star and submissions, thanks

Github: https://github.com/OpenEnvision/BlogXiv
Website: https://openenvision.github.io/BlogXiv/
kanaria007 
posted an update 1 day ago
view post
Post
99
✅ Article highlight: *Real-Scale World Simulation Game* (art-60-157, v0.1)

TL;DR:
This article asks what it would take to build a “real SAO-like” world without hand-wavy magic.

The answer is not unlimited freedom. It is a *persistent world with bounded agency*: NPCs can act, form societies, trade, govern, and shape history—but only through pinned profiles, CAS state, ledgers, receipts, and replayable world history. In other words: a living world is believable only if it is governable.

Read:
kanaria007/agi-structural-intelligence-protocols

Why it matters:
• shows how to move from “match fairness” to “world-history fairness”
• treats NPC societies as bounded agents rather than decorative scripts
• makes laws, markets, factions, and institutions explicit state layers instead of lore vibes
• explains why “living world” claims need receipts, replay, and anti-abuse monitoring

What’s inside:
• layered world state as CAS: *physics, economy, society, institution, narrative*
• NPCs as receipted bounded agents with observation, action, and resource limits
• institution ledgers for law, market rules, faction control, and world governance
• world replay as *history reproduction*, not just match replay
• adversary monitoring for griefing, market rigging, propaganda, and governance capture
• unique-entity / ownership / transfer receipts for “only one in the world” style claims

Key idea:
Do not say:

*“the world feels alive.”*

Say:

*“this world evolved through a receipted, bounded-agency closed loop: state, NPC decisions, player actions, institutional transitions, replay, monitoring, and publication rules.”*

That is how a persistent world becomes believable without becoming ungovernable.
juiceb0xc0de 
posted an update 2 days ago
view post
Post
128
Gemma-4-E2B SAE Atlas — Work in Progress

JumpReLU Sparse Autoencoders trained on every layer of Gemma-4-E2B-it using an adaptive Lagrangian controller. Training in progress. I'm publishing layers live as they come hot off the press for anyone interested in following along. I will be making further adjustments for finer resolution but the early data should be helpful I think? I'm just a bartender don't trust everything I say. 🤗 The Lagrangian math is pretty cool. It auto-steers the trainer taking the guess work out of hyperparameter adjustments.

Full paper and methodology when ever I get around to writing it up. There's a lot of work to be done. For now though, enjoy! 🤗

juiceb0xc0de/gemma-4-e2b-saes
  • 3 replies
·
AxionLab-official 
posted an update 4 days ago
view post
Post
157
We RELEASED!

SupraLabs just released our 50M model!
Base, Instruct Weights are there, you can use!

You can check blog to more informations!(Writing blog yet!)
Shrijanagain 
posted an update 4 days ago
view post
Post
2489
We are pleased to announce that the W-IMG Vision Dataset infrastructure is officially live.

The complete asset infrastructure is now accessible on Hugging Face for internal validation and architecture scaling targets.

Dataset Endpoint - sKT-Ai-Labs/W-IMG

#SovereignAI #ComputerVision #MachineLearning #OpenSource
danielhanchen 
posted an update 4 days ago