We encourage the community to contribute to this repository. If you have suggestions for new papers, resources, or improvements, please open an issue or submit a pull request.
A fundamental challenge for GUI agents is robustly grounding natural language instructions, which requires not only precise spatial alignment (locating elements accurately) but also correct semantic ...
From fine-tuning open source models to building agentic frameworks on top of them, the open source world is ripe with ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results