Skip to content
View thuqinyj16's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report thuqinyj16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 2,092 130 Updated Jan 28, 2025

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

257 12 Updated Jan 10, 2025
Python 1,149 41 Updated Nov 21, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,282 1,919 Updated Oct 15, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,818 102 Updated Jan 27, 2025

Utilities intended for use with Llama models.

Python 5,670 949 Updated Jan 24, 2025

Agentic components of the Llama Stack APIs

4,096 626 Updated Jan 28, 2025

The Memory layer for your AI apps

Python 24,172 2,248 Updated Jan 23, 2025
Jupyter Notebook 997 67 Updated Nov 27, 2024

🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.

Python 232 4 Updated Jan 4, 2025

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,527 240 Updated May 1, 2024

Grok open release

Python 49,873 8,342 Updated Aug 30, 2024

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

Python 126 15 Updated Sep 15, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 2,537 291 Updated Apr 24, 2024

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K…

98 7 Updated Jul 18, 2024

A UI-Focused Agent for Windows OS Interaction.

Python 6,480 829 Updated Jan 22, 2025

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 48 4 Updated Feb 20, 2024

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Python 390 36 Updated Dec 23, 2024

A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar

TypeScript 3,574 263 Updated Oct 27, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 99,064 16,112 Updated Jan 28, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,153 88 Updated Jan 16, 2024

The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".

Python 61 6 Updated Jul 13, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 696 89 Updated Jan 15, 2025
Python 591 27 Updated Feb 15, 2024

[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios

Python 64 8 Updated Dec 3, 2024

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python 492 83 Updated Dec 23, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,423 602 Updated Aug 8, 2024
Python 3 Updated Dec 15, 2023

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 79,944 11,308 Updated Jan 27, 2025
Next
Showing results