ScoutML
Sign In
Request Access
Wiki
/
Papers
/
2023 Papers
2023 Papers
Machine learning and AI research papers from 2023
Recent
Popular
2024
2023
2022
Scaling Vision Transformers to 22 Billion Parameters
2023
•
526 citations
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model
2023
•
525 citations
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
2023
•
523 citations
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
2023
•
520 citations
Can Large Language Models Be an Alternative to Human Evaluation?
2023
•
515 citations
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
2023
•
515 citations
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
2023
•
515 citations
Jailbreaking Black Box Large Language Models in Twenty Queries
2023
•
511 citations
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
2023
•
503 citations
4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
2023
•
501 citations
Muse: Text-To-Image Generation via Masked Generative Transformers
2023
•
498 citations
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
2023
•
498 citations
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
2023
•
496 citations
K-Planes: Explicit Radiance Fields in Space, Time, and Appearance
2023
•
495 citations
A survey on multimodal large language models
2023
•
494 citations
Mastering Diverse Domains through World Models
2023
•
493 citations
RWKV: Reinventing RNNs for the Transformer Era
2023
•
492 citations
Large Language Models Can Be Easily Distracted by Irrelevant Context
2023
•
489 citations
ChatGPT: Jack of all trades, master of none
2023
•
489 citations
In-Context Retrieval-Augmented Language Models
2023
•
489 citations
Large Language Models
2023
•
486 citations
VideoChat : Chat-Centric Video Understanding
2023
•
485 citations
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
2023
•
484 citations
Otter: A Multi-Modal Model with In-Context Instruction Tuning
2023
•
483 citations
Instruction Tuning for Large Language Models: A Survey
2023
•
478 citations
A General Theoretical Paradigm to Understand Learning from Human Preferences
2023
•
474 citations
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
2023
•
472 citations
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
2023
•
471 citations
Structure and Content-Guided Video Synthesis with Diffusion Models
2023
•
470 citations
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
2023
•
469 citations
ChatGPT and a New Academic Reality: AI-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing
2023
•
468 citations
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
2023
•
467 citations
Large Language Models are not Fair Evaluators
2023
•
464 citations
C-EVAL: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models
2023
•
462 citations
EXTENDING CONTEXT WINDOW OF LARGE LAN- GUAGE MODELS VIA POSITION INTERPOLATION
2023
•
459 citations
SEED-Bench: Benchmarking Multimodal LLMs with Generative Comprehension
2023
•
458 citations
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
2023
•
453 citations
Reasoning with Language Model is Planning with World Model
2023
•
451 citations
Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields
2023
•
447 citations
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
2023
•
444 citations
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
2023
•
443 citations
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
2023
•
442 citations
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
2023
•
439 citations
A Comprehensive Overview of Large Language Models
2023
•
439 citations
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
2023
•
438 citations
MATHVISTA: EVALUATING MATHEMATICAL REASON- ING OF FOUNDATION MODELS IN VISUAL CONTEXTS
2023
•
434 citations
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
2023
•
432 citations
A Comprehensive AI Policy Education Framework for University Teaching and Learning
2023
•
431 citations
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
2023
•
431 citations
Segment Anything Model for Medical Image Analysis: an Experimental Study
2023
•
430 citations
ChatGPT for Robotics: Design Principles and Model Abilities
2023
•
429 citations
Benchmarking Large Language Models for News Summarization
2023
•
429 citations
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
2023
•
426 citations
Highlights Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
2023
•
425 citations
Scaling up GANs for Text-to-Image Synthesis
2023
•
420 citations
CogVLM: Visual Expert for Pretrained Language Models
2023
•
419 citations
A Watermark for Large Language Models
2023
•
417 citations
MINIGPT-V2: LARGE LANGUAGE MODEL AS A UNIFIED INTERFACE FOR VISION-LANGUAGE MULTI-TASK LEARNING
2023
•
417 citations
BiFormer: Vision Transformer with Bi-Level Routing Attention
2023
•
417 citations
Segment Everything Everywhere All at Once
2023
•
416 citations
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
2023
•
414 citations
EVA-CLIP: Improved Training Techniques for CLIP at Scale Fight together with Rei at baaivision/EVA/CLIP
2023
•
414 citations
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
2023
•
413 citations
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation
2023
•
409 citations
Is ChatGPT a Good NLG Evaluator? A Preliminary Study
2023
•
409 citations
NExT-GPT: Any-to-Any Multimodal LLM
2023
•
409 citations
Bias and Fairness in Large Language Models: A Survey
2023
•
407 citations
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly
2023
•
404 citations
Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study
2023
•
403 citations
Zero-shot Image-to-Image Translation
2023
•
402 citations
Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis
2023
•
402 citations
Textbooks Are All You Need II: phi-1.5 technical report
2023
•
400 citations
AWQ: ACTIVATION-AWARE WEIGHT QUANTIZATION FOR ON-DEVICE LLM COMPRESSION AND ACCELERATION
2023
•
399 citations
ViperGPT: Visual Inference via Python Execution for Reasoning
2023
•
399 citations
Vision-Language Models for Vision Tasks: A Survey
2023
•
394 citations
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization
2023
•
393 citations
MusicLM: Generating Music From Text
2023
•
391 citations
CHATEVAL: TOWARDS BETTER LLM-BASED EVALUA- TORS THROUGH MULTI-AGENT DEBATE
2023
•
390 citations
HexPlane: A Fast Representation for Dynamic Scenes
2023
•
387 citations
LATENT CONSISTENCY MODELS: SYNTHESIZING HIGH-RESOLUTION IMAGES WITH FEW-STEP INFERENCE
2023
•
384 citations
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
2023
•
384 citations
Efficient Multi-Scale Attention Module with Cross-Spatial Learning
2023
•
383 citations
How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation
2023
•
380 citations
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models
2023
•
375 citations
SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES?
2023
•
374 citations
SYNCDREAMER: GENERATING MULTIVIEW-CONSISTENT IMAGES FROM A SINGLE-VIEW IMAGE
2023
•
372 citations
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
2023
•
370 citations
LARGE LANGUAGE MODELS CANNOT SELF-CORRECT REASONING YET
2023
•
370 citations
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
2023
•
369 citations
Are Emergent Abilities of Large Language Models a Mirage?
2023
•
367 citations
How Is ChatGPT's Behavior Changing over Time?
2023
•
362 citations
GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models
2023
•
361 citations
BEAVERTAILS: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
2023
•
361 citations
LRM: LARGE RECONSTRUCTION MODEL FOR SINGLE IMAGE TO 3D
2023
•
358 citations
Multimodal Chain-of-Thought Reasoning in Language Models
2023
•
357 citations
Segment Anything in Medical Images
2023
•
356 citations
SELFCHECKGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
2023
•
356 citations
ITRANSFORMER: INVERTED TRANSFORMERS ARE EFFECTIVE FOR TIME SERIES FORECASTING
2023
•
354 citations
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
2023
•
353 citations
ModelScope Text-to-Video Technical Report
2023
•
351 citations
First
Previous
Page 2 of 1031
Next
Last
Browse Papers By:
By Year
2024 Papers
2023 Papers
2022 Papers
2021 Papers
By Popularity
Most Cited
Recently Added