원문보기 내부링크

Tistory

<Multi-modal> FLAVA: A Foundational Language And Vision Alignment Model

예전(2021.12)에 나온 논문을 읽어보고 간단히 정리했습니다. 혹시 부족하거나 잘못된 내용이 있다면 댓글 부탁드립니다 ️ usechatgpt init success [Facebook AI Research (FAIR)] 여러 modality를 '한 번에' 처리할 수 있는 foundation 모델 FLAVA. vision, language, cross/multi-modal vision-langue task 전부 처리. 배경 그렇게 오래 전도 아니지만 이때만 하더라도 multi-modal 모델들의 성능은 지금과 사뭇 달랐던 것 같습니다. 본 논문에서 지적하고 있는 기존 모델들의 한계는 결국 모델의 능력이 '특정 modality에 국한'되어 있다는 것입니다. 여러 modality를 동시에 잘 이해하고 ..

원문보기 내부링크

chanmuzi의 등록된 링크

유데미 강의 후기 【한글자막】 Docker & Kubernetes : 실전 가이드, 【한글자막】 랭체인 - LangChain 으로 LLM 기반 애플리케이션 개발하기

서울 상위권 대학 인공지능 대학원 컨택 후기 (2024년 가을학기 입학 목표)

[Upstage] 업스테이지 AI Solution Reliability Engineer 합격 후기 (정규직 전환형 인턴십) (비전공자)

&lt;Document&gt; LayoutLM: Pre-training of Text and Layout for Document Image Understanding (2019.12)

AI Expo Korea 2024 국제인공지능대전 후기

Welcome Llama 3 - Meta’s new open LLM (HuggingFace 블로그 Llama 3 - ChatGPT 한글 번역)

&lt;SLM&gt; Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone (2024.04)

&lt;PEFT&gt; QLoRA (2023.05) 논문 상세 리뷰 및 간단 구현 (with Gemma)

&lt;Embedding&gt; LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders (2024.04)

&lt;PEFT&gt; ResLoRA: Identity Residual Mapping in Low-Rank Adaption (2024.02)

&lt;Reward&gt; Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation (2024.01)

&lt;LLM&gt; Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models (2024.02)

[대학원생 필수!] 논문 관리 프로그램 Zotero 추천 (WebDAV 연결, iPad annotation 싱크 관리)

TOEIC/TEPS 동시 준비 2주 만에 955/409점 달성한 후기

&lt;Prompting, Decomposition&gt; Least-to-Most Prompting Enables Complex Reasoning in Large Language Models (2023.04)

&lt;Data Type&gt; [BitNet b1.58] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits (2024.02)

&lt;CoT, Prompting&gt; [Google DeepMind] Chain-of-Thought Reasoning Without Prompting (2024.02)

&lt;LLM&gt; [Google DeepMind] Gemma: Open Models Based on GeminiResearch and Technology (2024.02)

&lt;Self&gt; SELF: Self-Evolution with Language Feedback (2023.10)

&lt;Decompose&gt; Self-Discover: Large Language Models Self-Compose Reasoning Structures (2024.02)

인공지능 최신 논문/뉴스 follow-up 꿀팁 대공개!! (NLP, LLM 위주 )

&lt;RAG, Refinement&gt; [CRAG] Corrective Retrieval Augmented Generation (2024.01)

GPT-4의 토큰별 예측 확률을 확인할 수 있을까? (부분적으로 가능하다!)

&lt;Benchmark, CoT&gt; [Google, REVEAL] A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains (2024.02)

&lt;RL, Fine-Tuning&gt; [ByteDance] ReFT - Reasoning with Reinforced Fine-Tuning (2024.01)

&lt;Pipeline, Rationale&gt; PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales (2023.04)

&lt;Distillation, Decoding&gt; [Proxy-tuning] Tuning Language Models by Proxy (2024.01)

KT 2023년도 봄학기 AI 석사과정 신입생 모집 서류 합격 및 코딩 테스트/인적성 검사 후기(비전공자)

&lt;KD, Fusion&gt; Knowledge Fusion of Large Language Models (2024.01)

&lt;KD, Hallucination&gt; [Idk Dataset] Can AI Assistants Know What They Don't Know? (2024.01)

&lt;RLAIF, Self&gt; Self-Rewarding Language Models (2024.01)

&lt;Supervision&gt; [OpenAI] Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision (2023.12)

&lt;Attention&gt; [CALM] LLM Augmented LLMs: Expanding Capabilities through Composition (2024.01)

&lt;LLM, RNN&gt; Transformers are Multi-State RNNs (2024.01)

&lt;Agent, VLM&gt; CogAgent: A Visual Language Model for GUI Agents (2023.12)

&lt;LLM&gt; [MoE] Mixtral of Experts (2024.01)

&lt;NLP&gt; [Transformer] Attention Is All You Need (2017.06)

&lt;sLLM&gt; TinyLlama: An Open-Source Small Language Model (2024.01)

[밑바닥부터 시작하는 딥러닝] Softmax 함수, 클래스 구현하기 (미분 수식, + with Cross Entropy)

&lt;LLM&gt; SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling (2023.12)

&lt;KD, Reasoning&gt; [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

&lt;DB, Agent&gt; [FunSearch] Mathematical discoveries from program search with largelanguage models (2023.12)

2023년 회고록: 성장하지 못한 낙동강 오리알 cc

&lt;CoT, Agent&gt; ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent (2023.12)

&lt;LoRA, MoE&gt; LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment (2023.12)

&lt;CoT&gt; [Self-Consistency] Self-Consistency Improves Chain of Thought Reasoning in Language Models (2022.03 -&gt; 2023.03)

&lt;RLHF&gt; Reinforced Self-Training (ReST) for Language Modeling (2023.08)

&lt;Retrieval, Knowledge Injection&gt; Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs (2023.12)

&lt;CoT, Agent&gt; ReAct: Synergizing Reasoning and Acting in Language Models (2022.10 → 2023.03)

&lt;LLM, Code&gt; [OSS-Instruct] Magicoder: Source Code Is All You Need (2023.12)

SK Tech Summit 2023 Day 2 후기 (23.11.17 금)

&lt;Dataset, Instruction&gt; AlpaGasus: Training A Better Alpaca with Fewer Data (2023.07)

&lt;LLM, Zero-shot&gt; [T0] Multitask Prompted Training Enables Zero-Shot Task Generalization (2022.03)

&lt;Prompt, Agent&gt; [SPP] Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration (2023.07)

&lt;Retrieval&gt; [GenRead] Generate rather than Retrieval: Large Language Models are Strong Context Generators (2023.01)

&lt;Retrieval&gt; [DSI] Transformer Memory as a Differentiable Search Index (2022.02)

&lt;LK Lab, CoT&gt; The CoT Collection: Improving Zero-shot and Few-shot Learning of Language MOdels via Chain-of-Thought Fine-Tuning (2023.10)

&lt;LK Lab, Retrieval&gt; [RoSPr] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt (2023.10)

&lt;Retrieval&gt; [DPR] Dense Passage Retrieval for Open-Domain Question Answering (2020.04)

&lt;LK Lab, Retrieval&gt; [ToC] Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models (2023.10)

&lt;LK Lab, Instruction&gt; [RoE] Exploring the Benfits of Training Expert Language Models over Instruction Tuning (2023.02)

&lt;LK Lab, Multi-modal&gt; [SeViT] Semi-Parametric Video-Grounded Text Generation (2023.01)

&lt;LK Lab, Multi-modal&gt; [ZeroTA] Zeor-Shot Dense Video Captioning by Jointly Optimizing Text and Moment (2023.01)

&lt;LK Lab, Evaluation&gt; Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models (2023.10)

&lt;LK Lab, Evaluation&gt; [FLASK] Fine-Grained Language Model Evaluation Based on Alignment Skill Sets (2023.10)

&lt;LK Lab, Evaluation&gt; Knowledge Unlearning for Mitigating Privacy Risks in Language Models (2022.12)

&lt;LK Lab, Alignment&gt; [ALMoST] Aligning Large Language Models through Synthetic Feedback (2023.10)

&lt;LK Lab, Instruction&gt; [Flipped Learning] Guess the Instructoin! Flipped Learning Makes Language Models Stronger Zero-Shot Learners (2023.06)

&lt;LK Lab, Retrieval&gt; [Np Decoding] Nonparametric Decoding for Generative Retrieval (2023.05)

&lt;LK Lab, Benchmark&gt; TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models (2022.04)

&lt;LK Lab, Retrieval&gt; REPLUG: Retrieval-Augmented Black-Box Language Models (2023.01)

&lt;Retrieval&gt; [RAG] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (2021.04)

&lt;LK Lab, Retrieval&gt; [GMR] Generative Multi-hop Retrieval (2022.10)

10월 3주차 논문 요약: GQA, LLM, LLMLingua, LLeMA, ToRA

10월 4주차 논문 요약: Ask Again, BitNet, Self-RAG, Meta-CoT, AutoDan, NEFTune, VeRA, Atlas

&lt;Retrieval&gt; [Short Paper Review] Retrieval meets Long Context Large Language Models

10월 2주차 논문 요약: Space and Time, RA-DIT, Mistral 7B

&lt;Multi-modal&gt; [LLaVA-1.5] Improved Baselines with Visual Instruction Tuning

&lt;LLM&gt; [Analogical Prompting] Large Language Models as Analogical Reasoners

<Document> LayoutLM: Pre-training of Text and Layout for Document Image Understanding (2019.12)

<SLM> Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone (2024.04)

<PEFT> QLoRA (2023.05) 논문 상세 리뷰 및 간단 구현 (with Gemma)

<Embedding> LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders (2024.04)

<PEFT> ResLoRA: Identity Residual Mapping in Low-Rank Adaption (2024.02)

<Reward> Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation (2024.01)

<LLM> Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models (2024.02)

<Prompting, Decomposition> Least-to-Most Prompting Enables Complex Reasoning in Large Language Models (2023.04)

<Data Type> [BitNet b1.58] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits (2024.02)

<CoT, Prompting> [Google DeepMind] Chain-of-Thought Reasoning Without Prompting (2024.02)

<LLM> [Google DeepMind] Gemma: Open Models Based on GeminiResearch and Technology (2024.02)

<Self> SELF: Self-Evolution with Language Feedback (2023.10)

<Decompose> Self-Discover: Large Language Models Self-Compose Reasoning Structures (2024.02)

<RAG, Refinement> [CRAG] Corrective Retrieval Augmented Generation (2024.01)

<Benchmark, CoT> [Google, REVEAL] A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains (2024.02)

<RL, Fine-Tuning> [ByteDance] ReFT - Reasoning with Reinforced Fine-Tuning (2024.01)

<Pipeline, Rationale> PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales (2023.04)

<Distillation, Decoding> [Proxy-tuning] Tuning Language Models by Proxy (2024.01)

<KD, Fusion> Knowledge Fusion of Large Language Models (2024.01)

<KD, Hallucination> [Idk Dataset] Can AI Assistants Know What They Don't Know? (2024.01)

<RLAIF, Self> Self-Rewarding Language Models (2024.01)

<Supervision> [OpenAI] Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision (2023.12)

<Attention> [CALM] LLM Augmented LLMs: Expanding Capabilities through Composition (2024.01)

<LLM, RNN> Transformers are Multi-State RNNs (2024.01)

<Agent, VLM> CogAgent: A Visual Language Model for GUI Agents (2023.12)

<LLM> [MoE] Mixtral of Experts (2024.01)

<NLP> [Transformer] Attention Is All You Need (2017.06)

<sLLM> TinyLlama: An Open-Source Small Language Model (2024.01)

<LLM> SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling (2023.12)

<KD, Reasoning> [NAT] Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data (2023.12)

<DB, Agent> [FunSearch] Mathematical discoveries from program search with largelanguage models (2023.12)

<CoT, Agent> ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent (2023.12)

<LoRA, MoE> LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment (2023.12)

<CoT> [Self-Consistency] Self-Consistency Improves Chain of Thought Reasoning in Language Models (2022.03 -> 2023.03)

<RLHF> Reinforced Self-Training (ReST) for Language Modeling (2023.08)

<Retrieval, Knowledge Injection> Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs (2023.12)

<CoT, Agent> ReAct: Synergizing Reasoning and Acting in Language Models (2022.10 → 2023.03)

<LLM, Code> [OSS-Instruct] Magicoder: Source Code Is All You Need (2023.12)

<Dataset, Instruction> AlpaGasus: Training A Better Alpaca with Fewer Data (2023.07)

<LLM, Zero-shot> [T0] Multitask Prompted Training Enables Zero-Shot Task Generalization (2022.03)

<Prompt, Agent> [SPP] Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration (2023.07)

<Retrieval> [GenRead] Generate rather than Retrieval: Large Language Models are Strong Context Generators (2023.01)

<Retrieval> [DSI] Transformer Memory as a Differentiable Search Index (2022.02)

<LK Lab, CoT> The CoT Collection: Improving Zero-shot and Few-shot Learning of Language MOdels via Chain-of-Thought Fine-Tuning (2023.10)

<LK Lab, Retrieval> [RoSPr] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt (2023.10)

<Retrieval> [DPR] Dense Passage Retrieval for Open-Domain Question Answering (2020.04)

<LK Lab, Retrieval> [ToC] Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models (2023.10)

<LK Lab, Instruction> [RoE] Exploring the Benfits of Training Expert Language Models over Instruction Tuning (2023.02)

<LK Lab, Multi-modal> [SeViT] Semi-Parametric Video-Grounded Text Generation (2023.01)

<LK Lab, Multi-modal> [ZeroTA] Zeor-Shot Dense Video Captioning by Jointly Optimizing Text and Moment (2023.01)

<LK Lab, Evaluation> Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models (2023.10)

<LK Lab, Evaluation> [FLASK] Fine-Grained Language Model Evaluation Based on Alignment Skill Sets (2023.10)

<LK Lab, Evaluation> Knowledge Unlearning for Mitigating Privacy Risks in Language Models (2022.12)

<LK Lab, Alignment> [ALMoST] Aligning Large Language Models through Synthetic Feedback (2023.10)

<LK Lab, Instruction> [Flipped Learning] Guess the Instructoin! Flipped Learning Makes Language Models Stronger Zero-Shot Learners (2023.06)

<LK Lab, Retrieval> [Np Decoding] Nonparametric Decoding for Generative Retrieval (2023.05)

<LK Lab, Benchmark> TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models (2022.04)

<LK Lab, Retrieval> REPLUG: Retrieval-Augmented Black-Box Language Models (2023.01)

<Retrieval> [RAG] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (2021.04)

<LK Lab, Retrieval> [GMR] Generative Multi-hop Retrieval (2022.10)

<Retrieval> [Short Paper Review] Retrieval meets Long Context Large Language Models

<Multi-modal> [LLaVA-1.5] Improved Baselines with Visual Instruction Tuning

<LLM> [Analogical Prompting] Large Language Models as Analogical Reasoners

<LLM> [Short Paper Review] Can large language models provide useful feedback...

<Attention> [Attention Sinks] Efficient Streaming Language Models with Attention Sinks

<Benchmark> [MMHAL-BENCH] Aligning Large Multimodal Models with Factually Augmented RLHF

<LLM> The Reversal Curse:LLMs trained on “A is B” fail to learn “B is A”

<Attention> LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

<LLM> [Qwen] Qwen Technical Report

<Benchmark> Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

<CoT> [CoVe] Chain-of-Verification Reduces Hallucination in Large Language Models

<Evaluation> RAIN: Your Language Models Can Align Themselves without Finetuning

<Image> [ViT] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

<Agent> Agents: An Open-source Framework for Autonomous Language Agents

<Prompting> [OPRO] Large Language Models as Optimizers

<LM> DeBERTa: Decoding-enhanced BERT with Disentangled Attention

<LM> DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

<Instruction> WizardCoder: Empowering Code Large Language Models with Evol-Instruct

<Alignment> RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

<Multi-modal> PointLLM: Empowering Large Language Models to Understand Point Clouds