Skip to content
View Ruiyang-061X's full-sized avatar
🤗
Focusing
🤗
Focusing

Block or report Ruiyang-061X

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoHallusion Codebase (EMNLP 2024)

Jupyter Notebook 17 1 Updated Dec 6, 2024

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 268 7 Updated Nov 13, 2024

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 267 13 Updated Mar 13, 2024

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 821 66 Updated Aug 27, 2024

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,424 432 Updated Sep 26, 2024

[AAAI 2024] Official PyTorch Implementation of ''BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving''.

Python 88 8 Updated Nov 13, 2024

✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).

27 Updated Jan 13, 2025

[ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinations in LVLMs by itself through logical closed loops.

Python 19 1 Updated Jun 20, 2024

Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Python 72 4 Updated Sep 15, 2024

Benchmarking LLMs via Uncertainty Quantification

Python 204 9 Updated Jan 30, 2024
Python 18 Updated Feb 28, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,037 68 Updated Jan 23, 2025

[ACL2023 Area Chair Award] Official repo for the paper "Tell2Design: A Dataset for Language-Guided Floor Plan Generation".

Python 58 4 Updated Oct 20, 2023

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 231 13 Updated Oct 7, 2024

✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Python 38 2 Updated Oct 17, 2024

Code and data of "Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models".

Python 4 1 Updated Nov 9, 2024

😎 Awesome lists about all kinds of interesting topics

343,320 28,344 Updated Dec 12, 2024

🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'24) that challenges vision-language models with simple questio…

Python 62 9 Updated Jan 23, 2025

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 100 5 Updated Jan 24, 2025

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

77 4 Updated Jan 9, 2025

🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".

Python 14 2 Updated Dec 19, 2024

Image augmentation for machine learning experiments.

Python 14,497 2,451 Updated Jul 30, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,803 377 Updated Mar 14, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,251 68 Updated Sep 27, 2024
Python 6 1 Updated Aug 28, 2024

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Python 812 45 Updated Dec 9, 2024

An open-source framework for training large multimodal models.

Python 3,804 292 Updated Aug 31, 2024

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 191 5 Updated Jul 1, 2024

[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Python 157 8 Updated Dec 3, 2024

List of papers on hallucination detection in LLMs.

747 60 Updated Dec 19, 2024
Next
Showing results