2023 Papers

Scaling Vision Transformers to 22 Billion Parameters

2023 • 526 citations

LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

2023 • 525 citations

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

2023 • 523 citations

Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation

2023 • 520 citations

Can Large Language Models Be an Alternative to Human Evaluation?

2023 • 515 citations

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models

2023 • 515 citations

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

2023 • 515 citations

Jailbreaking Black Box Large Language Models in Twenty Queries

2023 • 511 citations

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

2023 • 503 citations

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

2023 • 501 citations

Muse: Text-To-Image Generation via Masked Generative Transformers

2023 • 498 citations

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

2023 • 498 citations

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints

2023 • 496 citations

K-Planes: Explicit Radiance Fields in Space, Time, and Appearance

2023 • 495 citations

A survey on multimodal large language models

2023 • 494 citations

Mastering Diverse Domains through World Models

2023 • 493 citations

RWKV: Reinventing RNNs for the Transformer Era

2023 • 492 citations

Large Language Models Can Be Easily Distracted by Irrelevant Context

2023 • 489 citations

ChatGPT: Jack of all trades, master of none

2023 • 489 citations

In-Context Retrieval-Augmented Language Models

2023 • 489 citations

Large Language Models

2023 • 486 citations

VideoChat : Chat-Centric Video Understanding

2023 • 485 citations

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

2023 • 484 citations

Otter: A Multi-Modal Model with In-Context Instruction Tuning

2023 • 483 citations

Instruction Tuning for Large Language Models: A Survey

2023 • 478 citations

A General Theoretical Paradigm to Understand Learning from Human Preferences

2023 • 474 citations

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

2023 • 472 citations

MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing

2023 • 471 citations

Structure and Content-Guided Video Synthesis with Diffusion Models

2023 • 470 citations

Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models

2023 • 469 citations

ChatGPT and a New Academic Reality: AI-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing

2023 • 468 citations

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

2023 • 467 citations

Large Language Models are not Fair Evaluators

2023 • 464 citations

C-EVAL: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

2023 • 462 citations

EXTENDING CONTEXT WINDOW OF LARGE LAN- GUAGE MODELS VIA POSITION INTERPOLATION

2023 • 459 citations

SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension

2023 • 458 citations

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

2023 • 453 citations

Reasoning with Language Model is Planning with World Model

2023 • 451 citations

Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields

2023 • 447 citations

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

2023 • 444 citations

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

2023 • 443 citations

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

2023 • 442 citations

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

2023 • 439 citations

A Comprehensive Overview of Large Language Models

2023 • 439 citations

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

2023 • 438 citations

MATHVISTA: EVALUATING MATHEMATICAL REASON- ING OF FOUNDATION MODELS IN VISUAL CONTEXTS

2023 • 434 citations

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

2023 • 432 citations

A Comprehensive AI Policy Education Framework for University Teaching and Learning

2023 • 431 citations

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

2023 • 431 citations

Segment Anything Model for Medical Image Analysis: an Experimental Study

2023 • 430 citations

ChatGPT for Robotics: Design Principles and Model Abilities

2023 • 429 citations

Benchmarking Large Language Models for News Summarization

2023 • 429 citations

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

2023 • 426 citations

Highlights Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models

2023 • 425 citations

Scaling up GANs for Text-to-Image Synthesis

2023 • 420 citations

CogVLM: Visual Expert for Pretrained Language Models

2023 • 419 citations

A Watermark for Large Language Models

2023 • 417 citations

MINIGPT-V2: LARGE LANGUAGE MODEL AS A UNIFIED INTERFACE FOR VISION-LANGUAGE MULTI-TASK LEARNING

2023 • 417 citations

BiFormer: Vision Transformer with Bi-Level Routing Attention

2023 • 417 citations

Segment Everything Everywhere All at Once

2023 • 416 citations

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

2023 • 414 citations

EVA-CLIP: Improved Training Techniques for CLIP at Scale Fight together with Rei at baaivision/EVA/CLIP

2023 • 414 citations

CodeT5+: Open Code Large Language Models for Code Understanding and Generation

2023 • 413 citations

Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation

2023 • 409 citations

Is ChatGPT a Good NLG Evaluator? A Preliminary Study

2023 • 409 citations

NExT-GPT: Any-to-Any Multimodal LLM

2023 • 409 citations

Bias and Fairness in Large Language Models: A Survey

2023 • 407 citations

A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly

2023 • 404 citations

Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study

2023 • 403 citations

Zero-shot Image-to-Image Translation

2023 • 402 citations

Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis

2023 • 402 citations

Textbooks Are All You Need II: phi-1.5 technical report

2023 • 400 citations

AWQ: ACTIVATION-AWARE WEIGHT QUANTIZATION FOR ON-DEVICE LLM COMPRESSION AND ACCELERATION

2023 • 399 citations

ViperGPT: Visual Inference via Python Execution for Reasoning

2023 • 399 citations

Vision-Language Models for Vision Tasks: A Survey

2023 • 394 citations

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

2023 • 393 citations

MusicLM: Generating Music From Text

2023 • 391 citations

CHATEVAL: TOWARDS BETTER LLM-BASED EVALUA- TORS THROUGH MULTI-AGENT DEBATE

2023 • 390 citations

HexPlane: A Fast Representation for Dynamic Scenes

2023 • 387 citations

LATENT CONSISTENCY MODELS: SYNTHESIZING HIGH-RESOLUTION IMAGES WITH FEW-STEP INFERENCE

2023 • 384 citations

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

2023 • 384 citations

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

2023 • 383 citations

How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

2023 • 380 citations

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models

2023 • 375 citations

SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES?

2023 • 374 citations

SYNCDREAMER: GENERATING MULTIVIEW-CONSISTENT IMAGES FROM A SINGLE-VIEW IMAGE

2023 • 372 citations

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

2023 • 370 citations

LARGE LANGUAGE MODELS CANNOT SELF-CORRECT REASONING YET

2023 • 370 citations

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

2023 • 369 citations

Are Emergent Abilities of Large Language Models a Mirage?

2023 • 367 citations

How Is ChatGPT's Behavior Changing over Time?

2023 • 362 citations

GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models

2023 • 361 citations

BEAVERTAILS: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

2023 • 361 citations

LRM: LARGE RECONSTRUCTION MODEL FOR SINGLE IMAGE TO 3D

2023 • 358 citations

Multimodal Chain-of-Thought Reasoning in Language Models

2023 • 357 citations

Segment Anything in Medical Images

2023 • 356 citations

SELFCHECKGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

2023 • 356 citations

ITRANSFORMER: INVERTED TRANSFORMERS ARE EFFECTIVE FOR TIME SERIES FORECASTING

2023 • 354 citations

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

2023 • 353 citations

ModelScope Text-to-Video Technical Report

2023 • 351 citations

Browse Papers By:

By Year

By Popularity