Skip to content

feat: modified SWEBench modal harness to use modal experimental snapshots and processing TestOutput#665

Open
clee-codegen wants to merge 23 commits intodevelopfrom
swebench-sandbox-snapshots
Open

feat: modified SWEBench modal harness to use modal experimental snapshots and processing TestOutput#665
clee-codegen wants to merge 23 commits intodevelopfrom
swebench-sandbox-snapshots

Conversation

@clee-codegen
Copy link
Contributor

Motivation

Content

Testing

Please check the following before marking your PR as ready for review

  • I have added tests for my changes
  • I have updated the documentation or added new documentation as needed

@clee-codegen clee-codegen force-pushed the swebench-sandbox-snapshots branch from 8444ca6 to 7a46611 Compare February 28, 2025 00:27
@clee-codegen clee-codegen changed the title feat: first pass over using sandbox and snapshots feat: modified SWEBench modal harness to use modal experimental snapshots and processing TestOutput Feb 28, 2025
@clee-codegen clee-codegen force-pushed the swebench-sandbox-snapshots branch from 7a46611 to 7cf4f81 Compare February 28, 2025 00:28
@clee-codegen clee-codegen force-pushed the swebench-sandbox-snapshots branch from 2195fa5 to b5f1828 Compare February 28, 2025 00:30
Copy link
Contributor

@jemeza-codegen jemeza-codegen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like a good start!

raise NotImplementedError("Not implemented")


class VolumeSnapshotManager(SnapshotManager):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would we still need this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nah, we'd only use the ModalDict implementation. I left in here if for whatever reason modal Dict didn't work out. I saw in the documentation that its not a very stable dict in that it could get lost.

@codecov
Copy link

codecov bot commented Feb 28, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

Additional details and impacted files

clee-codegen and others added 9 commits February 27, 2025 16:40
# Motivation

The **Codegen on OSS** package provides a pipeline that:

- **Collects repository URLs** from different sources (e.g., CSV files
or GitHub searches).
- **Parses repositories** using the codegen tool.
- **Profiles performance** and logs metrics for each parsing run.
- **Logs errors** to help pinpoint parsing failures or performance
bottlenecks.

<!-- Why is this change necessary? -->

# Content

<!-- Please include a summary of the change -->
see
[codegen-on-oss/README.md](https://github.com/codegen-sh/codegen-sdk/blob/acfe3dc07b65670af33b977fa1e7bc8627fd714e/codegen-on-oss/README.md)

# Testing

<!-- How was the change tested? -->
`uv run modal run modal_run.py`
No unit tests yet 😿 

# Please check the following before marking your PR as ready for review

- [ ] I have added tests for my changes
- [x] I have updated the documentation or added new documentation as
needed
@clee-codegen clee-codegen force-pushed the swebench-sandbox-snapshots branch from b78b6b5 to 01236e5 Compare March 5, 2025 03:41
@clee-codegen clee-codegen force-pushed the swebench-sandbox-snapshots branch from 0cb107a to 60fed54 Compare March 11, 2025 01:11
@clee-codegen clee-codegen marked this pull request as ready for review March 12, 2025 01:18
@clee-codegen clee-codegen requested review from a team and codegen-team as code owners March 12, 2025 01:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants