Skip to content
View john-b-yang's full-sized avatar
🐶
wuphf.com
🐶
wuphf.com

Highlights

  • Pro

Organizations

@saasbook @SoftwareDefinedBuildings @61c-teach @SWE-bench

Block or report john-b-yang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
john-b-yang/README.md

Hey there 👋

I'm John! Currently a 2nd year CS PhD student at Stanford University.

Check out john-b-yang.github.io for more.

Pinned Loading

  1. SWE-agent/SWE-agent SWE-agent/SWE-agent Public

    SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

    Python 19k 2.1k

  2. SWE-bench/SWE-bench SWE-bench/SWE-bench Public

    SWE-bench: Can Language Models Resolve Real-world Github Issues?

    Python 4.7k 826

  3. SWE-bench/SWE-smith SWE-bench/SWE-smith Public

    [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

    Python 620 114

  4. CodeClash-ai/CodeClash CodeClash-ai/CodeClash Public

    Benchmarking Goal-Oriented Software Engineering

    Python 134 15

  5. princeton-nlp/WebShop princeton-nlp/WebShop Public

    [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

    Python 516 98

  6. princeton-nlp/intercode princeton-nlp/intercode Public

    [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

    Python 246 52