Would like a FileQueryTool that can describe any file type #1521

shhlife · 2024-12-12T16:17:55Z

shhlife
Dec 12, 2024
Maintainer

I have read and agree to the contributing guidelines.

Is your feature request related to a problem? Please describe.
We have an ImageQueryTool that gives an agent the ability to describe an image. As more models get multi-modal, it'd be nice to have a general purpose FileQueryTool that understands how to handle more artifact types - text, image, audio, video. Then we could simply use that tool and it would choose the appropriate loader

collindutter · 2024-12-12T19:46:23Z

collindutter
Dec 12, 2024
Maintainer

We can accomplish this behavior with a couple structures like so:

from griptape.drivers import LocalStructureRunDriver
from griptape.structures import Agent, Pipeline
from griptape.tasks import PromptTask, ToolTask
from griptape.tools import FileManagerTool, StructureRunTool

file_query = Pipeline(
    tasks=[
        ToolTask(tool=FileManagerTool(), id="file"),
        PromptTask(lambda task: task.parent_outputs["file"]),
    ],
)

agent = Agent(
    tools=[
        StructureRunTool(
            description="Can be used to describe files",
            structure_run_driver=LocalStructureRunDriver(
                create_structure=lambda: file_query
            ),
        )
    ]
)


agent.run("Describe this file: assets/mountain.jpg")

This actually has me thinking we might not even need an ImageQueryTool. While I admit it is a bit clunkier to set up, I'd rather invest the time in improving structure calling. What do you think?

Relevant: #1432 (comment)

0 replies

shhlife · 2024-12-12T22:52:26Z

shhlife
Dec 12, 2024
Maintainer Author

Interesting.. I do like this pattern, but if I hadn't seen it, I wouldn't have thought about using it. There's a discovery challenge here.. this is super cool, but new users won't know to think about things this way.

when I'm hunting around in code I see ImageQueryTool and I go - oh! I can query an image! That's a feature of Griptape!

Whereas this is way powerful.. but until I'm a lego master builder I won't know I can do this kind of thing. :)

0 replies

collindutter · 2024-12-12T22:55:28Z

collindutter
Dec 12, 2024
Maintainer

Totally understood, this pattern didn't come to me immediately either. Maybe this could be a docs recipe? I'd prefer to keep solutions built off of composable elements rather than be bespoke.

0 replies

shhlife · 2024-12-12T22:57:17Z

shhlife
Dec 12, 2024
Maintainer Author

yeah.. I feel like there's something to that. These patterns.. we could put them in recipes.. or is there a way to create scaffolding or something as part of Griptape? do other frameworks do those sorts of things?

like discoverable templates?

0 replies

shhlife · 2025-01-08T15:56:18Z

shhlife
Jan 8, 2025
Maintainer Author

Curious if you've thought any more about these? Like published snippets?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Griptape

Would like a FileQueryTool that can describe any file type #1521

{{title}}

Replies: 5 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Griptape

Would like a FileQueryTool that can describe any file type #1521

shhlife Dec 12, 2024 Maintainer

Replies: 5 comments

collindutter Dec 12, 2024 Maintainer

shhlife Dec 12, 2024 Maintainer Author

collindutter Dec 12, 2024 Maintainer

shhlife Dec 12, 2024 Maintainer Author

shhlife Jan 8, 2025 Maintainer Author

shhlife
Dec 12, 2024
Maintainer

collindutter
Dec 12, 2024
Maintainer

shhlife
Dec 12, 2024
Maintainer Author

collindutter
Dec 12, 2024
Maintainer

shhlife
Dec 12, 2024
Maintainer Author

shhlife
Jan 8, 2025
Maintainer Author