- Copenhagen, Denmark
- https://a-part.ai
- @esbenkc
Starred repositories
3cb: Catastrophic Cyber Capabilities Benchmarking of Large Language Models
Pin files for contextual, codebase-level AI assistance.
An open access book on scientific visualization using python and matplotlib
Some preliminary explorations of Mamba's context scaling.
Which objects are visible through the holes in a picture book? This visual task is easy for adults, doable for primary schoolers, but hard for vision transformers.
This repository contains code for the Democracy x AI Hackathon by Apart Research
How to get started in evaluations and demonstrations research for dangerous capabilities
PrimeVul with the assets under version control on github, not on google drive
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challen…
🔐 Make sure AI applications are not injecting 1) suspicious API calls, 2) vulnerabilities, and 3) rogue capabilities
Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
apartresearch / task-standard
Forked from METR/task-standard🚨 METR Task Standard fork for the Code Red Hackathon
An easy-to-use Python framework to generate adversarial jailbreak prompts.
Python code for "Fishing for the answer: Mapping the flow of information in LLM agent groups using lessons from fish schools" submitted to Apart Research Multi-Agent Security Hackathon 2024.