Summary
Look into https://github.com/AmrDab/clawdcursor
Which will allow to classify screenshots and check if that works well with deepseek.
It should be way faster instead of relying on Opus models or any vision models really.
Decoupling vision from the models is the right architectural decision for computer use.
Acceptance Criteria
Summary
Look into https://github.com/AmrDab/clawdcursor
Which will allow to classify screenshots and check if that works well with deepseek.
It should be way faster instead of relying on Opus models or any vision models really.
Decoupling vision from the models is the right architectural decision for computer use.
Acceptance Criteria