使用Decoder-only的Transformer进行时序预测,包含SwiGLU和RoPE(Rotary Positional Embedding),Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)
-
Updated
Jan 25, 2024 - Python
使用Decoder-only的Transformer进行时序预测,包含SwiGLU和RoPE(Rotary Positional Embedding),Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)
A single-file implementation of LLaMA 3, with support for jitting, KV caching and prompting
Add a description, image, and links to the rotary-positional-embedding topic page so that developers can more easily learn about it.
To associate your repository with the rotary-positional-embedding topic, visit your repo's landing page and select "manage topics."