StOnEGiggity

Follow

🎯

Focusing

TianqiTang StOnEGiggity

🎯

Focusing

Follow

43 followers · 134 following

Achievements

Achievements

Highlights

Pro

Stars

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,220 1,940 Updated Jan 22, 2025

IamCreateAI / Ruyi-Models

Python 458 27 Updated Jan 20, 2025

FoundationVision / Liquid

Liquid: Language Models are Scalable Multi-modal Generators

60 Updated Dec 12, 2024

langgptai / Awesome-Multimodal-Prompts

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

237 16 Updated Oct 25, 2023

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,869 343 Updated Apr 25, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,845 89 Updated Jan 15, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,705 119 Updated Dec 6, 2024

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

RupertLuo / Valley

The official repository of "Video assistant towards large language model makes everything easy"

Python 217 14 Updated Dec 24, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 15,158 1,432 Updated Jan 18, 2025

antoyang / FrozenBiLM

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Python 155 23 Updated Dec 9, 2024

mbzuai-oryx / VideoGPT-plus

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 249 15 Updated Aug 11, 2024

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,275 110 Updated Aug 27, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,653 4,750 Updated Jan 21, 2025

PKU-YuanGroup / Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,121 222 Updated Dec 3, 2024

UIC-Liu-Lab / ContinualLM

An Extensible Continual Learning Framework Focused on Language Models (LMs)

Python 263 20 Updated Jan 28, 2024

facebookresearch / open-eqa

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 250 23 Updated Sep 20, 2024

md-mohaiminul / VideoRecap

Python 173 9 Updated Jul 12, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,482 154 Updated Oct 28, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,745 260 Updated Aug 9, 2024

ecrireme / SPR

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Python 29 4 Updated Jan 5, 2022

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,857 1,045 Updated Jan 20, 2025

GitGyun / visual_token_matching

[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Python 253 13 Updated Oct 13, 2023

usail-hkust / Awesome-Urban-Foundation-Models

An Awesome Collection of Urban Foundation Models (UFMs).

149 14 Updated Jan 17, 2025

facebookresearch / imu2clip

Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)

Python 86 8 Updated Aug 30, 2023

xiaobai1217 / Awesome-Video-Datasets

Video datasets

1,294 98 Updated Mar 8, 2023

AI4LIFE-GROUP / OpenXAI

OpenXAI : Towards a Transparent Evaluation of Model Explanations

JavaScript 237 41 Updated Aug 17, 2024

wengzejia1 / Open-VCLIP

Python 110 3 Updated Feb 19, 2024

houzhijian / GroundNLQ

The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023

Python 16 1 Updated Jan 23, 2024

yukw777 / EILEV

EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Python 119 9 Updated Nov 10, 2024