A generative speech model for daily dialogue.
-
Updated
Nov 5, 2024 - Python
A generative speech model for daily dialogue.
📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
A linting tool for Chinese language.
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Rime Cantonese input schema | 粵語拼音輸入方案
A framework for cleaning Chinese dialog data
收集非普通話漢語和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects
Discovering magic squares in Tang Dynasty poems
Learn, read, write and practice Mandarin by drawing strokes in Anki Desktop, AnkiDroid and AnkiMobile with audio of HSK 2.0 (HSK1-6) and HSK 3.0 (HSK 1-9) characters.
Python scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조
solidity-by-example 教程中文翻译|@Web3-Club
Free Human Language Learning Resources
Từ điển tiếng Việt dành cho máy đọc sách Kindle, Kobo, Pocketbook v.v.
文本去重
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
Add a description, image, and links to the chinese-language topic page so that developers can more easily learn about it.
To associate your repository with the chinese-language topic, visit your repo's landing page and select "manage topics."