feat: python tools requirement by akihikokuroda · Pull Request #1040 · generative-computing/mellea

akihikokuroda · 2026-05-07T21:22:36Z

Requirement PR

Use this template when adding or modifying requirements in mellea/stdlib/requirements/.

Description

Link to Issue: fix: Requirements library for the Python tool #1023

Add requirements for Python code generation.

Implementation Checklist

Base Class

Extends appropriate base class:
- Requirement - standard requirement
- ALoraRequirement - uses specialized Intrinsic/Adapter for generation-based validation

Validation Logic

validation_fn defined (if using Python-based validation)
- re-usable functionality within the validation_fn should be separated out into mellea/stdlib/tools/
validate returns a ValidationResult with
- a thunk and context if using a backend to generate
- a specific reason and score when possible

Integration

Requirement exported in mellea/stdlib/requirements/__init__.py or, if you are adding a library of requirements, from your sub-module

Testing

Tests added to tests/requirements/
New code has 100% coverage
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

Attribution

AI coding assistants used: claude

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

github-actions · 2026-05-07T21:22:56Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

AngeloDanducci · 2026-05-08T20:26:34Z

+                result=False,
+                reason="Your code creates plots with pyplot but never calls `plt.savefig()` to save them.\n\n"
+                "Add this before your plotting code or at the end:\n"
+                "  plt.savefig('{output_path}')\n"


I think this should match the approach in _make_output_artifacts_validator in the way it handles path/output_path.

I assume this was intended to be a f" string instead of a " string.

AngeloDanducci · 2026-05-08T20:26:49Z

+                "Fix this by adding to the top of your code:\n"
+                "  import matplotlib\n"
+                "  matplotlib.use('Agg')\n\n"
+                "Then replace `plt.show()` with `plt.savefig('{output_path}'); plt.close()`",


Same as below ie

I think this should match the approach in _make_output_artifacts_validator in the way it handles path/output_path.

I assume this was intended to be a f" string instead of a " string.

AngeloDanducci · 2026-05-08T20:33:55Z

+    return None
+
+
+def _get_unauthorized_imports(


can we reuse this? https://github.com/generative-computing/mellea/blob/main/mellea/stdlib/tools/interpreter.py#L304

Yes, it is. A new helper function is added that is used by both.

AngeloDanducci · 2026-05-08T20:40:06Z

+    return validate
+
+
+def _make_output_limit_validator(


Does this have an associated test?

Bit hazy on the stdout and stderr here - in this case the ctx.last_output() is a ModelOutputThunk I think? Does that have stdout/err?

Here is what the claude found about the stdout and stderr:

Your intuition is correct. Here's the precise behavior:

ctx.last_output() returns ModelOutputThunk | None

ModelOutputThunk doesn't define stdout/stderr in its class

These attributes are dynamically injected at runtime when tools execute and return ExecutionResult

The hasattr() checks (lines 366-369) are defensive and correct — they gracefully handle both cases (tool execution with output vs. pure LLM response)

No, there was no tests. New tests are added now.

markstur · 2026-05-08T20:00:41Z

+        print("=" * 70)
+        print("Testing Granite 4.1's ability to repair plotting failures")
+        print("=" * 70)
+        print(f"Task: Create a plot of sin(x) and save to {output_path}\n")


suggest using a task variable to share the task string used in actual description with here.

markstur · 2026-05-08T20:03:15Z

+
+async def main():
+    """Run the canonical plotting repair example."""
+    import tempfile


move all the imports to the top

markstur · 2026-05-08T20:23:49Z

+                    module_name = node.module.split(".")[0]
+                    if module_name not in allowed_imports:
+                        unauthorized.append(module_name)
+    except (SyntaxError, ValueError):


why not raise these?
This pass means that the function did not do its job.

Most likely this code is unusable elsewhere anyway, but if that external assumption is true it just snuck past the unauthorized import check.

Comments are added explaining this.

markstur · 2026-05-08T20:38:53Z

+    headless_backends = ("Agg", "Svg", "Cairo", "PDF", "PS", "WebAgg", "nbAgg")
+    for backend in headless_backends:
+        if (
+            f"matplotlib.use('{backend}')" in code


commented out code would be a false positive.

You might consider using python tokenize to strip comments.
Probably strip docstring too.

Added simpler way to strip the comments.

markstur · 2026-05-08T21:34:09Z

+        m = mellea.start_session()
+
+        # Create requirements bundle for plotting validation
+        # Allows matplotlib import (no output_path = skip file creation check)


This example is too confusing. It always fail w/o output_path because there is a bunch of code to ensure that the output file exists. Then I add output_path and still consistently fail to get a file.

This might be intentional (fail to write file), but if so it needs to be clearer because right now it looks like a bad example that needs fixing/debugging.

Yes, this is intentional. I added comment explaining a little better.

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

python tools requirement

dbef526

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

akihikokuroda requested a review from a team as a code owner May 7, 2026 21:22

akihikokuroda requested review from jakelorocco and markstur May 7, 2026 21:22

github-actions Bot added the enhancement New feature or request label May 7, 2026

AngeloDanducci requested changes May 8, 2026

View reviewed changes

AngeloDanducci reviewed May 8, 2026

View reviewed changes

markstur reviewed May 8, 2026

View reviewed changes

review comments

119699e

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

akihikokuroda requested a review from AngeloDanducci May 9, 2026 12:47

review comments

9055101

Signed-off-by: Akihiko Kuroda <akihikokuroda2020@gmail.com>

akihikokuroda requested a review from markstur May 9, 2026 13:28

Conversation

akihikokuroda commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Requirement PR

Description

Implementation Checklist

Base Class

Validation Logic

Integration

Testing

Attribution

Uh oh!

github-actions Bot commented May 7, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

akihikokuroda commented May 7, 2026 •

edited

Loading