Skip to content

How to Construct a Ground-Truth Test Dataset #69

Open
@hungnh1125

Description

Hi,

I noticed that you used SAM and Grounding DINO to generate segmentation masks.

Could you please explain how you merge the outputs from SAM and Grounding DINO to create the ground-truth in the GranD-f dataset?
Additionally, could you describe the process of creating the final dense caption?
Is your method fully automated, or does it require manual verification?
I am interested in applying your method to the GranD dataset.

Thank you.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions