Jinyuan Li's picture

Jinyuan Li

jinyuan222

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

upvoted a paper 14 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

upvoted a paper 15 days ago

Process Rewards with Learned Reliability

View all activity

Organizations

upvoted a paper about 14 hours ago

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling

Paper • 2606.03102 • Published 1 day ago • 10

upvoted a paper 14 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published 15 days ago • 50

upvoted a paper 15 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 20 days ago • 53

submitted a paper to Daily Papers 15 days ago

Process Rewards with Learned Reliability

Paper • 2605.15529 • Published 20 days ago • 53

upvoted a paper 15 days ago

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published 17 days ago • 67

upvoted a paper 24 days ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published 27 days ago • 69

upvoted a paper 27 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 28 days ago • 37

updated a model about 2 months ago

jinyuan222/visualprm400K-all-soft-vanillaprm-internvl3-8b

8B • Updated Apr 13 • 4

published a model about 2 months ago

jinyuan222/visualprm400K-all-soft-vanillaprm-internvl3-8b

8B • Updated Apr 13 • 4

updated a model about 2 months ago

jinyuan222/visualprm400K-all-soft-vanillaprm-internvl3-14b

15B • Updated Apr 12 • 1

published a model about 2 months ago

jinyuan222/visualprm400K-all-soft-vanillaprm-internvl3-14b

15B • Updated Apr 12 • 1

updated a model about 2 months ago

jinyuan222/visualprm400K-beta-binom-internvl3-14b

15B • Updated Apr 9 • 3

published a model about 2 months ago

jinyuan222/visualprm400K-beta-binom-internvl3-14b

15B • Updated Apr 9 • 3

updated a model about 2 months ago

jinyuan222/visualprm400K-beta-binom-internvl3-8b

8B • Updated Apr 7 • 3

published a model about 2 months ago

jinyuan222/visualprm400K-beta-binom-internvl3-8b

8B • Updated Apr 7 • 3

updated a model about 2 months ago

jinyuan222/visualprm400K-random8

8B • Updated Apr 6 • 5

published a model about 2 months ago

jinyuan222/visualprm400K-random8

8B • Updated Apr 6 • 5

updated a model about 2 months ago

jinyuan222/visualprm400K-lowmc8

8B • Updated Apr 6 • 4

published a model about 2 months ago

jinyuan222/visualprm400K-lowmc8

8B • Updated Apr 6 • 4

updated a model about 2 months ago

jinyuan222/visualprm400K-mix8

8B • Updated Apr 6 • 4