Skip to content
View StOnEGiggity's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report StOnEGiggity

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A generative world for general-purpose robotics & embodied AI learning.

Python 23,220 1,940 Updated Jan 22, 2025
Python 458 27 Updated Jan 20, 2025

Liquid: Language Models are Scalable Multi-modal Generators

60 Updated Dec 12, 2024

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

237 16 Updated Oct 25, 2023

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,869 343 Updated Apr 25, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,845 89 Updated Jan 15, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,705 119 Updated Dec 6, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

The official repository of "Video assistant towards large language model makes everything easy"

Python 217 14 Updated Dec 24, 2024

Fast and memory-efficient exact attention

Python 15,158 1,432 Updated Jan 18, 2025

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Python 155 23 Updated Dec 9, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 249 15 Updated Aug 11, 2024

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,275 110 Updated Aug 27, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,653 4,750 Updated Jan 21, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,121 222 Updated Dec 3, 2024

An Extensible Continual Learning Framework Focused on Language Models (LMs)

Python 263 20 Updated Jan 28, 2024

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 250 23 Updated Sep 20, 2024
Python 173 9 Updated Jul 12, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,482 154 Updated Oct 28, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,745 260 Updated Aug 9, 2024

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

Python 29 4 Updated Jan 5, 2022

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,857 1,045 Updated Jan 20, 2025

[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Python 253 13 Updated Oct 13, 2023

An Awesome Collection of Urban Foundation Models (UFMs).

149 14 Updated Jan 17, 2025

Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)

Python 86 8 Updated Aug 30, 2023

Video datasets

1,294 98 Updated Mar 8, 2023

OpenXAI : Towards a Transparent Evaluation of Model Explanations

JavaScript 237 41 Updated Aug 17, 2024
Python 110 3 Updated Feb 19, 2024

The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023

Python 16 1 Updated Jan 23, 2024

EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Python 119 9 Updated Nov 10, 2024
Next
Showing results