ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K…

98 7 Updated Jul 18, 2024

microsoft / UFO

A UI-Focused Agent for Windows OS Interaction.

Python 6,480 829 Updated Jan 22, 2025

OpenBMB / Tell_Me_More

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 48 4 Updated Feb 20, 2024

zjunlp / EasyInstruct

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Python 390 36 Updated Dec 23, 2024

gdh1995 / vimium-c

A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar

TypeScript 3,574 263 Updated Oct 27, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 99,064 16,112 Updated Jan 28, 2025

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,153 88 Updated Jan 16, 2024

thunlp / DebugBench

The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".

Python 61 6 Updated Jul 13, 2024

OSU-NLP-Group / SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 696 89 Updated Jan 15, 2025

allenai / unified-io-2

Python 591 27 Updated Feb 15, 2024

Junjie-Ye / ToolEyes

[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

Python 64 8 Updated Dec 3, 2024

OpenBMB / RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python 492 83 Updated Dec 23, 2024

TencentQQGYLab / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,423 602 Updated Aug 8, 2024

thunlp / UnifiedInstructionTuning

Python 3 Updated Dec 15, 2023

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 79,944 11,308 Updated Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yujia Qin thuqinyj16

Achievements

Achievements

Highlights

Block or report thuqinyj16

Stars

bytedance / UI-TARS-desktop

bytedance / UI-TARS

OS-Copilot / OS-Atlas

Open-Source-O1 / Open-O1

openai / swarm

fla-org / flash-linear-attention

meta-llama / llama-models

meta-llama / llama-stack-apps

mem0ai / mem0

google-deepmind / open_x_embodiment

appl-team / appl

databricks / dbrx

xai-org / grok-1

THUNLP-MT / StableToolBench

deepseek-ai / DeepSeek-VL

google-research-datasets / screen_qa