Skip to content
View CarlHuangNuc's full-sized avatar

Block or report CarlHuangNuc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)

Python 201 22 Updated Feb 21, 2023

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

Python 290 13 Updated Feb 20, 2025

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,659 195 Updated Mar 6, 2025

A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API

TypeScript 7,994 1,359 Updated Feb 27, 2025

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 182 12 Updated Mar 3, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,462 596 Updated Mar 7, 2025

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 231 8 Updated Feb 23, 2025
Python 115 5 Updated Feb 15, 2025

Code for paper "Achieving Sparse Activation in Small Language Models"

Python 6 Updated Sep 2, 2024

Fast and memory-efficient exact attention

Python 16,160 1,529 Updated Mar 7, 2025

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 58 8 Updated Jun 26, 2024

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 931 46 Updated Feb 25, 2025

The code for ICCV2023 Oral paper: Identity-Seeking Self-Supervised Representation Learning for Generalizable Person Re-identification

Python 80 1 Updated Oct 1, 2023

[CVPR2023] Referring Multi-Object Tracking

Python 131 15 Updated Jul 2, 2024

Large-Vocabulary Video Instance Segmentation dataset

Python 81 1 Updated Jul 5, 2024

OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]

Jupyter Notebook 97 11 Updated Oct 14, 2024

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Python 985 62 Updated Jan 27, 2024

Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception

Python 710 52 Updated Jan 12, 2024

Associating Objects with Transformers for Video Object Segmentation

134 10 Updated Mar 21, 2024

This repository implements continuous test-time adaptation algorithms for object detection on the SHIFT dataset.

Python 26 3 Updated Jul 5, 2024

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Python 2,053 165 Updated Aug 20, 2022

(TPAMI 2024) A Survey on Open Vocabulary Learning

898 51 Updated Dec 10, 2024

[CVPR 2023] Unifying Short and Long-Term Tracking with Graph Hierarchies

Python 124 10 Updated Dec 22, 2024

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 749 52 Updated Mar 20, 2024

A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023

Python 183 18 Updated Apr 16, 2023
Python 51 5 Updated Jul 5, 2023

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 883 49 Updated Jul 6, 2024

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,817 198 Updated Nov 15, 2024

SHIFT Dataset DevKit - CVPR2022

Python 105 10 Updated Jan 8, 2024
Next
Showing results