Skip to content
View Cwcw32's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Harbin Institute of Technology
  • Milky Way

Block or report Cwcw32

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM Analytics

TypeScript 595 23 Updated Oct 3, 2024

assistant tools for attention visualization in deep learning

Jupyter Notebook 969 76 Updated Jun 9, 2022

Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting

12 2 Updated Mar 25, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

1 Updated Sep 15, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,167 89 Updated Oct 3, 2024

Tracking the most popular Github repos, update daily(Python version)

Python 531 134 Updated Oct 3, 2024

Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]

Python 34 2 Updated Sep 27, 2024

TrustAgent: Towards Safe and Trustworthy LLM-based Agents

Python 22 1 Updated Jul 29, 2024

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 343 51 Updated Sep 25, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 82,179 7,641 Updated Oct 3, 2024

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Python 1,150 98 Updated Sep 27, 2024

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Python 194 11 Updated May 11, 2024

Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)

Python 43 11 Updated Oct 1, 2024

Code and data for COLING2024 paper "Characteristic AI Agents via Large Language Models".

Python 23 Updated Mar 21, 2024

ICLR 2024 spotlight

Python 211 29 Updated Jun 6, 2024

An index of algorithms for reinforcement learning from human feedback (rlhf))

85 1 Updated Apr 17, 2024

The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection

207 18 Updated Mar 20, 2023

Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"

14 2 Updated Aug 5, 2024

CRiskEval is a Chinese dataset meticulously designed for gauging the risk proclivities inherent in LLMs.

3 Updated Jun 5, 2024

Safety-J: Evaluating Safety with Critique

JavaScript 13 1 Updated Jul 28, 2024

The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.

Python 19 3 Updated Sep 6, 2024
Jupyter Notebook 3 Updated Jun 27, 2024

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)

Python 37 6 Updated Aug 3, 2024
Python 43 2 Updated Jan 24, 2024

DSIR large-scale data selection framework for language model training

Python 223 19 Updated Apr 7, 2024

Data Structures and Information Retrieval in Python

Jupyter Notebook 130 50 Updated Aug 1, 2024

一个开源免费的AI聊天客户端!

C++ 55 8 Updated Sep 26, 2024

基于U3D实现的桌宠,全部代码,项目目录结构可在本人b站视频中对照查看。因为模型不能公布,代码有需要的部分自取,有疑问可交流。以前做的,编程习惯和命名做得不好请见谅,不用来提醒我

C# 21 1 Updated Oct 11, 2023
Python 5 1 Updated Jun 30, 2024

大模型检索增强生成技术最佳实践。

Python 33 4 Updated Sep 4, 2024
Next