German Commons, a newly released dataset, is now the largest openly licensed collection of German-language text. By using permissive licenses, it ensures that developers and researchers can train German language models without navigating complex copyright issues. This dataset encompasses diverse sources—from literature and news to social media—enabling comprehensive language understanding. With German Commons, AI practitioners can build and deploy German LLMs (large language models) legally and ethically, fostering innovation in German NLP. The open licensing also invites community contributions and continuous improvements, ensuring long-term sustainability and relevance. As demand for German AI applications grows, German Commons sets a new standard for transparent and compliant dataset sharing.
はじめて仮想通貨を買うなら Coincheck !
- ✅ アプリDL 国内 No.1
- ✅ 500円 から 35 銘柄を購入
- ✅ 取引開始まで 最短1日
口座開設は完全無料。思い立った今がはじめどき!
👉 登録手順を画像つきで確認する












