Skip to content
View michaelhunley's full-sized avatar

Block or report michaelhunley

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. autoreward autoreward Public

    Turn subjective quality judgments at each step into empirical signals that drive auto-validation and RLAIF pipelines. Tiers (Measured > Predicted > Reviewed) + gauge library + model index, injectab…

    Python 1