Investigations - LLM / Large Language Model
Topics related to https://en.wikipedia.org/wiki/Large_language_model
-
-
All-in-One Engine+UI
-
local Engine Server
Web UI
Software Development
-
ChatDev - https://github.com/OpenBMB/ChatDev
-
AutoPR - https://github.com/irgolic/AutoPR
Supporting Tool/Lib
-
LiteLLM - https://github.com/BerriAI/litellm/
-
Ellama (LLM from Emacs) - https://github.com/s-kostyaev/ellama
Models
Models with Japanese Support
-
…
-
CyberAgentLM3
References
-
Dolly 2.0 - https://twitter.com/kun1em0n/status/1646356918049599488
-
Vicuna-based
-
LLaMA2
-
ELYZA-japanese-Llama
-
LLM helpers
-
https://twitter.com/kis/status/1686333070742568960?t=IBMI7ICMQmxTp8fDTaeMRA&s=19
-
Japanese models
-
https://twitter.com/npaka123/status/1753336604759118014?t=ySDSi8UYHf8HXIpKm0TbHg&s=19
-
HyperCLOVALINE Japanese LLM -
Japanese LLMs: フリーで使える日本語の主な大規模言語モデル(LLM)まとめ - https://zenn.dev/hellorusk/articles/ddee520a5e4318
-
Rinna
-
https://twitter.com/sudy_super/status/1680148471821529090?t=Vd_2mtQggcpBZsMjlKaIzA&s=19
-
Helper Implementations
-
Branching Approaches of LLM-based Technologies
-
Transformer
-
GPT
-
Bloom
-
Megatron
-
Deep Speed
-
-
Models
-
Mistral AI + Microsoft