"GPUメモリに限りがある状況（16GB T4や24GB RTX3090など）でも大規模な言語モデルを高パフォーマンスで実行できる「FlexGen」"

kns_1234 のブックマーク 2023/02/21 11:52

<blockquote class="hatena-bookmark-comment"><a class="comment-info" href="https://b.hatena.ne.jp/entry/4732659427161609476/comment/kns_1234" data-user-id="kns_1234" data-entry-url="https://b.hatena.ne.jp/entry/s/github.com/FMInference/FlexGen" data-original-href="https://github.com/FMInference/FlexGen" data-entry-favicon="https://cdn-ak2.favicon.st-hatena.com/64?url=https%3A%2F%2Fgithub.com%2FFMInference%2FFlexGen" data-user-icon="/users/kns_1234/profile.png">GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.</a><ul class="comment-tag" style="list-style: none; margin: 0px;"><li style="float: left">[<a href="https://b.hatena.ne.jp/q/%E4%BA%BA%E5%B7%A5%E7%9F%A5%E8%83%BD">人工知能</a>]</li><li style="float: left">[<a href="https://b.hatena.ne.jp/q/%E6%8A%80%E8%A1%93">技術</a>]</li></ul><br><p style="clear: left">&quot;GPUメモリに限りがある状況（16GB T4や24GB RTX3090など）でも大規模な言語モデルを高パフォーマンスで実行できる「FlexGen」&quot;</p><a class="datetime" href="https://b.hatena.ne.jp/kns_1234/20230221#bookmark-4732659427161609476"><span class="datetime-body">2023/02/21 11:52</span></a></blockquote><script src="https://b.st-hatena.com/js/comment-widget.js" charset="utf-8" async></script>

このブックマークにはスターがありません。
最初のスターをつけてみよう！

GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.

github.com/FMInference2023/02/21

In recent years, large language models (LLMs) have shown great performance across a wide range of tasks. Increasingly, LLMs have been applied not only to interactive applications (such as chat), bu...

38 人がブックマーク・3 件のコメント

他のコメントを読む

＼コメントがサクサク読めるアプリです／

はてなブックマーク

GitHub - FMInference/FlexGen: Running large language models on a single GPU for throughput-oriented scenarios.

はてなブックマーク

公式Twitter

はてなのサービス