Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

Kseniase 
posted an update 1 day ago
view post
Post
4342
10 Latest Preference Optimization Techniques

Models need feedback on what makes outputs “good” or “bad.” Policy optimization (PO) turns preferences and rewards into actual training signals. This field is evolving quickly, moving far beyond classics like PPO and GRPO. So here is our overview of 10 newest PO methods:

1. Pref-GRPO → Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning (2508.20751)
Stabilizes text-to-image reinforcement learning (RL) with pairwise preference rewards and a unified UNIGENBENCH benchmark

2. PVPO (Policy with Value Preference Optimization) → PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning (2508.21104)
This critic-free RL method uses a pre-trained model as a reference anchor to reduce bias and guide learning, selecting high-value examples through data pre-sampling

3. DCPO (Dynamic Clipping Policy Optimization) → DCPO: Dynamic Clipping Policy Optimization (2509.02333)
Uses dynamic clipping, which adjusts probability limits per token for better token exploration, and smooth reward standardization to balance rewards over training steps and prevent wasted updates

4. ARPO (Agentic Reinforced Policy Optimization) → Agentic Reinforced Policy Optimization (2507.19849)
Optimizes multi-turn LLM agents that use external tools. It uses an entropy-based adaptive rollout to explore post-tool use and an advantage attribution method to better assign credit across steps, leading to more efficient tool use with fewer resources

5. GRPO-RoC (Group Relative Policy Optimization with Resampling-on-Correct) → rStar2-Agent: Agentic Reasoning Technical Report (2508.20722)
Oversamples rollouts, then resamples them to keep diverse mistakes and only the highest-quality correct answers. It reduces noises and ends up with stronger reasoning in a code environment

Read further below ⬇️
If you like this, also subscribe to the Turing post: https://www.turingpost.com/subscribe
  • 1 reply
·
prithivMLmods 
posted an update 2 days ago
view post
Post
5514
Dropped the HeadshotX : a super-realistic headshot adapter for Qwen/Qwen-Image, an image generation model by Qwen. It is an advanced LoRA adaptation of the Qwen-Image model and an upgraded version of prithivMLmods/Qwen-Image-Studio-Realism, offering more precise portrait rendering with a strong focus on realism. The model was trained on diverse face types from across the world, labeled with florence2-en and caption-optimized using prithivMLmods/DeepCaption-VLA-7B. 11(types) × 5 different face types: Asian, Hispanic, Caucasian, Latina, Middle Eastern, etc.

⮞ Model🤗: prithivMLmods/Qwen-Image-HeadshotX

⮞ The Previous Adapter (LoRA): prithivMLmods/Qwen-Image-Studio-Realism

⮞ Collection: prithivMLmods/qwen-image-exp-lora-68a978fe11400bc3165b0c4d

.
.
.
To know more about it, visit the app page or the respective model page!!
  • 2 replies
·
MonsterMMORPG 
posted an update 2 days ago
view post
Post
5045
SUPIR is Still Unchallanged Image Upscaler — Supports GPUs starting from RTX 1000 series to RTX 5000 series

App Download Link
You can download SUPIR app from here : https://www.patreon.com/posts/99176057

CHECK BELOW SCREENSHOTS

It has 1-click installers for Windows (only Python 3.10.11 and Git should be sufficient), RunPod (official Pytorch 2.2.0 template) and Massed Compute template Creator > SECourses

App Info
SUPIR: Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild 1 click installer scripts.

SUPIR Sampler and Text CFG Comparison : https://imgsli.com/MjU2ODQz/2/1

Gemini 2.5 Pro prompt to get image description for free :

describe this image for sdxl. write single line prompt to regenerate it exactly same. make the prompt extremely detailed

https://aistudio.google.com/prompts/new_chat

Use Default preset for highest loyalty and Replicate preset for adding more details

Human upscale from 1024x1024 to 3072x3072 (3x upscale and total 9x resolution) with face restore comparison

https://imgsli.com/NDEzMDYx

Owl upscale from 1024x1024 to 3072x3072 (3x upscale and total 9x resolution)

https://imgsli.com/NDEzMDYy

Video Tutorials
Tutorials are older but hopefully a newer one will be made and they should be still useful

Complete Guide to SUPIR Enhancing and Upscaling Images Like in Sci-Fi Movies on Your PC

How To Install SUPIR On RunPod and Massed Compute

How To Install & Use SUPIR : SOTA Image Upscaler On RunPod — 1 Click Easy Install & Run

6 September 2025 Update V91
Libraries upgraded to Torch 2.8, CUDA 12.9, xFormers 0.0.33, Flash Attention 2.8.3

You don’t need to have CUDA or anything else installed and it should work with Python 3.10.11 and Git installed

When compiling libraries, I added support for all GPUs starting from RTX 1000 to 5000 series including other GPUs like A100, H100, B200, L40, etc

Compiled for TORCH_CUDA_ARCH_LIST=6.1;7.5;8.0;8.6;8.9;9.0;10.0;12.0

DualityAI-RebekahBogdanoff 
posted an update 3 days ago
view post
Post
3504
Shout out to the winners of the "Synthetic2Real Object Detection Challenge" Duality AI hosted earlier this year. Out of the 1000+ participants in our challenges, these users stood out above the rest.

🥇 1st place: Kaggle user "richardtroy"

🥈 2nd place: @sergio-sanz-rodriguez

🥉 3rd place: @Nadiaaaaaaa

View the entire leaderboard at - https://tinyurl.com/38ebvcwf

Join our current Grocery Items: Multi-Class Object Detection Synthetic2Real Kaggle competition here: https://tinyurl.com/y224rttu

And be on the lookout for anther competition in the next couple weeks with a brand new domain!
hint: ✈️
hesamation 
posted an update 4 days ago
view post
Post
4189
a senior engineer at google just dropped a 400-page free book on docs for review: agentic design patterns.

the table of contents looks like everything you need to know about agents + code:
> advanced prompt techniques
> multi-agent patterns
> tool use and MCP
> you name it

read it here: https://docs.google.com/document/d/1rsaK53T3Lg5KoGwvf8ukOUvbELRtH-V0LnOIFDxBryE/edit?tab=t.0#heading=h.pxcur8v2qagu

you can also pre-order on Amazon (published by Springer) and the royalties goes to Save the Children: https://www.amazon.com/Agentic-Design-Patterns-Hands-Intelligent/dp/3032014018/
salma-remyx 
posted an update about 18 hours ago
view post
Post
2687
The docs for GitRank are live! Follow along to see how you can:

📖 Daily personalized papers from arXiv matching your project context
👩‍💻 One-click PRs with complete implementation, tests, and docs
🚀 Parallel experimentation - test multiple ideas with ease

Your next great idea is probably in a paper you haven't had time to implement.

Try it today! http://docs.remyx.ai/resources/ideate
prithivMLmods 
posted an update 3 days ago
view post
Post
3224
Comparing: DeepCaption-VLA-7B, built on Qwen2.5-VL-7B-Instruct, is tailored for image captioning and vision-language attribution, focusing on precise, descriptive captions of visual properties, object attributes, and scene details. In contrast, Qwen2.5-VL-7B-Abliterated-Caption-it is fine-tuned for abliterated captioning, generating highly detailed descriptions across diverse visual categories.

Models🤗
✦ DeepCaption-VLA-7B : prithivMLmods/DeepCaption-VLA-7B
✦ Qwen2.5-VL-7B-Abliterated-Caption-it : prithivMLmods/Qwen2.5-VL-7B-Abliterated-Caption-it

Spaces⛵
➜ VisionScope-R2 : prithivMLmods/VisionScope-R2
➜ Qwen2.5-VL-Outpost : prithivMLmods/Qwen2.5-VL-Outpost

Collection🗞️
DeepCaption attr. : prithivMLmods/deepcaption-attr-68b041172ebcb867e45c556a
VL Abliterated-Caption : prithivMLmods/vl-abliterated-caption-68a0443b63182e97a15c47a3
Multimodal VLMs - Until July'25 : prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027
Multimodal VLMs - Aug'25 : prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027

GitHub↗️
> DeepCaption-VLA-7B [4bit-notebook demo] : https://github.com/PRITHIVSAKTHIUR/Multimodal-Outpost-Notebooks/blob/main/DeepCaption-VLA-7B%5B4bit%20-%20notebook%20demo%5D/DeepCaption-VLA-7B.ipynb
> Qwen2.5-VL-3B-Abliterated-Caption-it(caption) : https://github.com/PRITHIVSAKTHIUR/Multimodal-Outpost-Notebooks/blob/main/Qwen2.5-VL-3B-Abliterated-Caption-it(caption)/Qwen2_5_VL_3B_Abliterated_Caption_it.ipynb

The community GPU grant was given by Hugging Face — special thanks to them. 🤗🚀

To know more about it, visit the app page or the respective model page!!
burtenshaw 
posted an update 3 days ago
view post
Post
2521
The open source AI community is just made of people who are passionate and care about their work. So we thought it would be cool to share our favourite icons of the community with a fun award.

Winners get free Hugging Face Pro Subscriptions, Merchandise, or compute credits for the hub.

🔗 Follow and nominate here: community-spotlight

This is a new initiative to recognise and celebrate the incredible work being done by community members. It's all about inspiring more collaboration and innovation in the world of machine learning and AI.

They're highlighting contributors in four key areas:
- model creators: building and sharing innovative and state-of-the-art models.
- educators: sharing knowledge through posts, articles, demos, and events.
- tool builders: creating the libraries, frameworks, and applications that we all use.
- community champions: supporting and mentoring others in forums.

Know someone who deserves recognition? Nominate them by opening a post in the Hugging Face community forum.
  • 1 reply
·
burtenshaw 
posted an update about 7 hours ago
view post
Post
119
new smol course

If you’re building with or learning about post training AI models right now, we have a new FREE and CERTIFIED course.

🔗 Follow the org to join in smol-course

The course builds on smol course v1 which was the fastest way to learn to train your custom AI models. It now has:

- A leaderboard for students to submit models to
- Certification based on exams and leaderboards
- Prizes based on Leaderboards
- Up to date content on TRL and SmolLM3
- Deep integration with the Hub’s compute for model training and evaluation

We will release chapters every few weeks, so you can follow the org to stay updated.
Reubencf 
posted an update about 9 hours ago
view post
Post
70
Introducing the Nano Banana Node Editor! 🍌

Now you can control and manipulate Nano Banana images with a powerful, intuitive node-based system. Explore the creative possibilities at: Reubencf/Nano_Banana_Editor

This version is clearer, more inviting, and emphasizes the creative potential of your tool.