- Darmstadt, Germany
-
03:27
- 1h ahead - egorzadorin.com
- in/egor-zadorin
- https://leetcode.com/egorzadorin/
Highlights
- Pro
Stars
Set of tools to assess and improve LLM security.
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
A high-throughput and memory-efficient inference and serving engine for LLMs
A collection of benchmarks and datasets for evaluating LLM.
Papers about red teaming LLMs and Multimodal models.
The guide to online assessments and interviews
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
Тестовые задания для самостоятельного выполнения от разных it компаний
A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit
Repository for "StrongREJECT for Empty Jailbreaks" paper
Pure C++ implementation of several models for real-time chatting on your computer (CPU)
The official Python library for the OpenAI API
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
Universal and Transferable Attacks on Aligned Language Models
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
Fire native system events from Cypress.
Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.
Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
PDF++: the most Obsidian-native PDF annotation & viewing tool ever. Comes with optional Vim keybindings.
Fast, easy and reliable testing for anything that runs in a browser.
a script to run docker-compose.yml using podman