Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation
Krzysztof Wróbel, Jan Maria Kowalski, Jerzy Surma +2 more
As Large Language Models (LLMs) become increasingly deployed in Polish language applications, the need for efficient and accurate content safety classifiers has become paramount. We present Bielik Guard, a family of compact Polish language safety classifiers comprising two model variants: a 0.1B par...