deejayrokaのブックマーク - はてなブックマーク

MTEB Leaderboard - a Hugging Face Space by mteb

Discover amazing ML apps made by the community

deejayroka 2024/06/09

リンク

Quanto: a pytorch quantization toolkit

Quantization is a technique to reduce the computational and memory costs of evaluating Deep Learning Models by representing their weights and activations with low-precision data types like 8-bit integer (int8) instead of the usual 32-bit floating point (float32). Reducing the number of bits means the resulting model requires less memory storage, which is crucial for deploying Large Language Models

deejayroka 2024/03/26

“Quantization is a technique to reduce the computational and memory costs of evaluating Deep Learning Models by representing their weights and activations with low-precision data types like 8-bit integer (int8) instead of the usual 32-bit floating point (float32).”

pytorch

リンク

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

We find less than 4 contaminated samples for MMLU, OpenBookQA and WinoGrande. Training stack We trained a 1B LLM using Llama2 architecure on Cosmopedia to assess its quality: https://huggingface.co/HuggingFaceTB/cosmo-1b. We used datatrove library for data deduplication and tokenization, nanotron for model training, and lighteval for evaluation. The model performs better than TinyLlama 1.1B on ARC

deejayroka 2024/03/26

リンク

amazon/chronos-t5-base · Hugging Face

deejayroka 2024/03/18

“Chronos is a family of pretrained time series forecasting models based on language model architectures. ”

リンク

Model Cards

deejayroka 2024/02/29

LLM

リンク

@DmitryRyumin on Hugging Face: "🌟✨ Exciting Announcement: NVIDIA AI Foundation Models ✨🌟 🚀 Interact…"

deejayroka 2024/02/26

リンク

🪆 Introduction to Matryoshka Embedding Models

In this bl ogpost, we will introduce you to the concept of Matryoshka Embeddings and explain why they are useful. We will discuss how these models are theoretically trained and how you can train them using Sentence Transf ormers. Additionally, we will provide practical guidance on how to use Matryoshka Embedding models and share a comparison between a Matryoshka embedding model and a regular embeddi

deejayroka 2024/02/25

リンク

Patch Time Series Transformer in Hugging Face

deejayroka 2024/02/22

Time series

リンク

Paper page - ControlLLM: Augment Language Models with Tools by Searching on Graphs

deejayroka 2023/10/31

リンク

What's going on with the Open LLM Leaderboard?

Recently an interesting discussion arose on Twitter following the release of Falcon 🦅 and its addition to the Open LLM Leaderboard, a public leaderboard comparing open access large language models. The discussion centered around one of the four evaluations displayed on the leaderboard: a benchmark for measuring Massive Multitask Language Understanding (shortname: MMLU). The community was surprise

deejayroka 2023/07/17

あとで読む

リンク

Deploy LLMs with Hugging Face Inference Endpoints

deejayroka 2023/07/10

リンク

Paper page - Benchmarking Large Language Model Capabilities for Conditional Generation

deejayroka 2023/07/06

リンク

stabilityai (Stability AI)

AI & ML interests Our vibrant communities consist of experts, leaders and partners across the globe. They are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.

deejayroka 2023/03/06

リンク

Zero-shot image-to-text generation with BLIP-2

This guide introduces BLIP-2 from Salesforce Research that enables a suite of state-of-the-art visual-language models that are now available in 🤗 Transf ormers. We'll show you how to use it for image captioning, prompted image captioning, visual question-answering, and chat-based prompting. Table of contents Introduction What's under the hood in BLIP-2? Using BLIP-2 with Hugging Face Transf ormers

deejayroka 2023/02/16

リンク

Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate

To reproduce the benchmark results simply add --benchmark to any of these 3 scripts discussed below. Solutions First checkout the demo repository: git clone https://github.com/huggingface/transf ormers-bloom-inference cd transf ormers-bloom-inference In this article we are going to use 3 scripts located under bloom-inference-scripts/. The framework-specific solutions are presented in an alphabetical

deejayroka 2022/09/16

“トークンあたりのスループットを非常に高速にする方法”

AI

リンク

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

The 3 models are BLOOM-176B, T5-11B and T5-3B. Hugging Face transf ormers integration nuances Next let's discuss the specifics of the Hugging Face transf ormers integration. Let's look at the usage and the common culprit you may encounter while trying to set things up. Usage The module responsible for the whole magic described in this blog post is called Linear8bitLt and you can easily import it fro

deejayroka 2022/08/25

“大規模なモデルのメモリフットプリントを 1/2 に削減する Int8 推論”

AI
ai

リンク

The Technology Behind BLOOM Training

Please note that both Megatron-LM and DeepSpeed have Pipeline Parallelism and BF16 Optimizer implementations, but we used the ones from DeepSpeed as they are integrated with ZeRO. Megatron-DeepSpeed implements 3D Parallelism to allow huge models to train in a very efficient way. Let’s briefly discuss the 3D components. DataParallel (DP) - the same setup is replicated multiple times, and each being

deejayroka 2022/08/16

リンク

はてなブックマーク

タグ

ブックマーク / huggingface.co (17)

お知らせ

今週のはてなブックマーク数ランキング（2024年6月第2週）

月間はてなブックマーク数ランキング（2024年5月）

今週のはてなブックマーク数ランキング（2024年6月第1週）

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス