LLM Encoder/Decoder

About 125,000 results

Open links in new tab

Any time

sebastianraschka.com
https://magazine.sebastianraschka.com › understanding-encoder-and...
Understanding Encoder And Decoder LLMs - Sebastian Raschka, …
Jun 17, 2023 · Delve into Transformer architectures: from the original encoder-decoder structure, to BERT & RoBERTa encoder-only models, to the GPT series focused on decoding. Explore their evolution, strengths, & applications in NLP tasks.
medium.com
https://medium.com › @yumo-bai
Why are most LLMs decoder-only? - Medium
Feb 3, 2024 · In a translation task, an encoder takes an English sentence and converts it into a vector that represents its linguistic features and meaning. Decoder: Takes the encoded representation and...
zhihu.com
https://zhuanlan.zhihu.com
LLM的3种架构：Encoder-only、Decoder-only、encode-decode
LLMs中有的是只有编码器encoder-only，有的只有解码器decoder-only，有的是2者混合 encoder decoder hybrid。三者都属于 Seq2Seq ，sequence to sequence。并且字面意思是虽只有编码器encoder，实际上LLMs是能decoder一些文本和token的，也算是decoder。
unite.ai
https://www.unite.ai › decoder-based-large-language-models-a...
Decoder-Based Large Language Models: A Complete Guide
Apr 27, 2024 · Decoder-based LLMs can be broadly classified into three main types: encoder-decoder, causal decoder, and prefix decoder. Each architecture type exhibits distinct attention patterns. Encoder-Decoder Architecture
arxiv.org
https://arxiv.org › abs
[2304.04052] Decoder-Only or Encoder-Decoder? Interpreting …
Apr 8, 2023 · This paper aims to address this gap by conducting a detailed comparison between the encoder-decoder architecture and the decoder-only language model framework through the analysis of a regularized encoder-decoder structure.
medium.com
https://medium.com › @gilbertomanunza › why-do-some...
Why do some LLMs have both an Encoder and a Decoder and …
May 4, 2024 · Some others, like T5, have both an encoder and a decoder, with some small modifications on the architecture and training strategy. Why some LLMs took only a part of the original transformer...
arxiv.org
https://arxiv.org › abs
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Apr 9, 2024 · In this work, we introduce LLM2Vec, a simple unsupervised approach that can transform any decoder-only LLM into a strong text encoder. LLM2Vec consists of three simple steps: 1) enabling bidirectional attention, 2) masked next token prediction, and 3) unsupervised contrastive learning.
medium.com
https://medium.com › byte-sized-ai › llm-inference-a-detailed...
LLM Inference — A Detailed Breakdown of Transformer ... - Medium
Sep 29, 2024 · In this blog, we’ll explore the architecture of Transformer-based Large Language Models (LLMs), focusing on the encoder-decoder structure. Although this analysis centers on the encoder-decoder...
plainenglish.io
https://ai.plainenglish.io › encoder-only-vs-decoder-only-style-llm...
Encoder-Only vs Decoder-Only Style LLM Architectures: …
Sep 22, 2024 · Encoder-Only vs Decoder-Only: Uncovering the Distinctions. Let’s highlight the key differences between encoder-only and decoder-only architectures: Use Cases: Encoder-only models are well-suited for predictive modeling tasks, leveraging embeddings for classification.
arxiv.org
https://arxiv.org › html
Understanding LLMs: A Comprehensive Overview from Training to …
The Encoder-decoder architecture of LLMs is built upon the traditional Transformer Encoder-decoder architecture. The Encoder-decoder architecture consists of two main components: the Encoder and the Decoder.
Some results have been removed
Pagination
- 1
- 2
- 3
- 4
- 5
- Next

Understanding Encoder And Decoder LLMs - Sebastian Raschka, …

Why are most LLMs decoder-only? - Medium

LLM的3种架构：Encoder-only、Decoder-only、encode-decode

Decoder-Based Large Language Models: A Complete Guide

[2304.04052] Decoder-Only or Encoder-Decoder? Interpreting …

Why do some LLMs have both an Encoder and a Decoder and …

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

LLM Inference — A Detailed Breakdown of Transformer ... - Medium

Encoder-Only vs Decoder-Only Style LLM Architectures: …

Understanding LLMs: A Comprehensive Overview from Training to …