Investigations - LLM / Large Language Model
Topics related to https://en.wikipedia.org/wiki/Large_language_model
-
-
All-in-One Engine+UI
-
local Engine Server
Web UI
-
https://github.com/open-webui/open-webui
-
Tips
-
table { text-wrap: pretty; }
-
-
Software Development
-
ChatDev - https://github.com/OpenBMB/ChatDev
-
AutoPR - https://github.com/irgolic/AutoPR
Supporting Tool/Lib
-
LiteLLM - https://github.com/BerriAI/litellm/
-
Ellama (LLM from Emacs) - https://github.com/s-kostyaev/ellama
Models
Models with Japanese Support
-
…
-
CyberAgentLM3
Prompt
-
https://gist.github.com/philschmid/3a0ecc9e45763716f4dd9c36b6445fca#file-openai_meta-txt
Tool Calling / Function Calling
References
-
Dolly 2.0 - https://twitter.com/kun1em0n/status/1646356918049599488
-
Vicuna-based
-
LLaMA2
-
ELYZA-japanese-Llama
-
LLM helpers
-
https://twitter.com/kis/status/1686333070742568960?t=IBMI7ICMQmxTp8fDTaeMRA&s=19
-
Japanese models
-
https://twitter.com/npaka123/status/1753336604759118014?t=ySDSi8UYHf8HXIpKm0TbHg&s=19
-
HyperCLOVALINE Japanese LLM -
Japanese LLMs: フリーで使える日本語の主な大規模言語モデル(LLM)まとめ - https://zenn.dev/hellorusk/articles/ddee520a5e4318
-
Rinna
-
https://twitter.com/sudy_super/status/1680148471821529090?t=Vd_2mtQggcpBZsMjlKaIzA&s=19
-
Helper Implementations
-
Branching Approaches of LLM-based Technologies
-
Transformer
-
GPT
-
Bloom
-
Megatron
-
Deep Speed
-
-
Models
-
Mistral AI + Microsoft
-
Ollama