[B! language-model][arxiv] nabinnoのブックマーク

nabinno id:nabinno

language-modelとarxivに関するnabinnoのブックマーク (11)

A Comprehensive Overview of Large Language Models (LLMs) for Cyber Defences: Opportunities and Directions
- 1 user
- arxiv.org
- 学び
nabinno 2024/05/25
arxiv

language-model

security-engineering
リンク
RAG評価ツール「RAGAS」の論文を読む
本記事では、RAG評価ツールの「RAGAS」の論文ついてざっくり理解します。軽めの記事です。株式会社ナレッジセンスでは、生成AIやRAGシステムを活用したサービスを開発しており、その中でもRAG精度の評価は非常に重要です。この記事は何この記事は、「RAGAS」についての論文[1]を、日本語で簡単にまとめたものです。RAGASはツールとして有名ではあるものの、RAGASの論文を読んだことがある方は多くなさそうです。 RAGASとは、おそらく今、一番有名なRAG評価ツールです。同様のツールとして、他にはLangSmithやARESが有名かと思いますが、他の開発者と話をしていても、評価ツールとして一番に名前が上がりやすいのは、やはりRAGASです。また、今回も「そもそもRAGとは？」については、知っている前提で進みます。確認する場合は以下の記事もご参考下さい。本題ざっくりサマリー R
nabinno 2024/05/17
zenn

knowledge-sense

atsushi-kadowaki

arxiv

retrieval-augumented-generation

language-model
リンク
Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification
View a PDF of the paper titled Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification, by Benjamin Clavi\'e and Alexandru Ciceu and Frederick Naylor and Guillaume Souli\'e and Thomas Brightwell View PDF Abstract:This case study investigates the task of job classification in a real-world setting, where the goal is to determine whether an English-langu
nabinno 2023/10/22
arxiv

thesis

prompt-engineering

gpt

language-model
リンク
GitHub - rkmt/summarize_arxv
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert
nabinno 2023/06/25
github

summarize_arxiv

arxiv

thesis

gpt

language-model

python
リンク
https://arxiv.org/pdf/2305.09617.pdf
- 2 users
- arxiv.org
- 学び
nabinno 2023/05/17
arxiv

google-research

deepmind

med-palm

language-model

uslme

healthcare-industry
リンク
GPT-4が労働市場に与える影響と各職種のリスク評価──OpenAIの研究者が論文発表
大規模言語モデル（LLM）の「GPT-4」を手掛けた米OpenAIとペンシルベニア大学の研究者らは3月17日、「GPTはGPT：LLMの労働市場への影響の可能性に関する初期の考察」と題した論文を公開した。「GPT（Generative Pre-trained Transf ormer）モデルと関連技術が米国の労働市場に与える潜在的な影響を調査」したとしている。調査の結果、米国の労働力の約80％が、GPTの導入によって少なくとも仕事の10％に影響を受ける可能性があり、約19％の労働者は仕事の50％に影響を受ける可能性があることが示されたという。ほぼすべての職種に影響するが、特に現在高収入な職種のリスクが高いとしている。調査は、学習達成度を表を用いて測定するルーブリック評価によって、職業別の人間の専門知識とGPT-4を使った場合を比較した。方法は、1016の職業について、職業ごとに測定す
nabinno 2023/03/22
itmedia

arxiv

openai

tyna-eloundou

sam-manning

pamela-mishkin

daniel-rock

gpt-4

language-model

labor-economics
リンク
Toolformer: Language Models Can Teach Themselves to Use Tools
Language models (LMs) exhibit remarkable abilities to solve new tasks from just a few examples or textual instructions, especially at scale. They also, paradoxically, struggle with basic functionality, such as arithmetic or factual lookup, where much simpler and smaller models excel. In this paper, we show that LMs can teach themselves to use external tools via simple APIs and achieve the best of
nabinno 2023/02/13
arxiv

toolformer

language-model

gpt-3

transformer

search-engine
リンク
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
The introduction of ChatGPT has garnered widespread attention in both academic and industrial communities. ChatGPT is able to respond effectively to a wide range of human questions, providing fluent and comprehensive answers that significantly surpass previous public chatbots in terms of security and usefulness. On one hand, people are curious about how ChatGPT is able to achieve such strength and
nabinno 2023/02/05
arxiv

chatgpt

gpt-3

language-model
リンク
ChatGPTと人間の違いは？　約4万件の質疑応答から中国の研究者らが分析
Innovative Tech：このコーナーでは、テクノロジーの最新研究を紹介するWebメディア「Seamless」を主宰する山下裕毅氏が執筆。新規性の高い科学論文を山下氏がピックアップし、解説する。Twitter: ＠shiropen2 中国の上海財経大学などに所属する研究者らが発表した論文「How Close is ChatGPT to Human Experts？ Comparison Corpus, Evaluation, and Detection」は、ChatGPTと人間の専門家が答える内容のそれぞれの特徴や違いを調査した研究報告である。 2022年11月のリリース以来、OpenAIのChatGPTは自然言語処理（NLP）コミュニティー含め、学術界や産業界など多くの分野で注目を集めている。有用性の面でこれまでの公共チャットbotを大幅にしのいでおり、さまざまなところで活用され
nabinno 2023/02/05
itmedia

hiroki-yamashita

arxiv

chatgpt

gpt-3

language-model

emotion
リンク
[PDF] Large Language Models Encode Clinical Knowledge
nabinno 2023/01/25
arxiv

google-research

deepmind

med-palm

language-model

uslme

healthcare-industry
リンク
Large Language Models Encode Clinical Knowledge
Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, but the quality bar for medical and clinical applications is high. Today, attempts to assess models' clinical knowledge typically rely on automated evaluations on limited benchmarks. There is no standard to evaluate model predictions and reasoning across a breadth of tasks. To a
nabinno 2023/01/25
arxiv

deepmind

med-palm

language-model

usmle

healthcare-industry
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx