Nous Research presents Minos, a lightweight classifier designed to detect refusals in text. Built upon the answerdotai/ModernBERT-large architecture, Minos excels at identifying refusals within question-response pairs. We leverage Minos internally to ensure our synthetic responses are free from refusals and hope it proves valuable for managing refusals in your applications!
Will give the refusal/non-refusal classification after every message
Only tested on my personal environment but im just using a venv rather than a docker env, i believe it should work right out of the box but you may need to either restart the server or do a quick test script referenced in the HF nous/minos page to download the model and confirm it works in your env.