jk-final

Final Project for CMSC 25040: Introduction to Computer Vision focused on DETR.

Objective:

DETR is an object detection architecture consisting of a joint Convolutional Neural Network (CNN) and Transformer with a feed-forward network as a head. The Transformer component is responsible for reasoning about object relations using the multi-head attention mechanism, given features extracted by the CNN. Compared with other object detectors, DETR effectively replaces hand-designed components, such as region proposals, non-maximum suppression, or anchor generation, that explicitly encode possible object locations in the image

(1) Implement new dataloaders to process the chosen dataset other than COCO.

(2) Adapt the DETR structure to the classes for your dataset.

(3) Modify training/testing/visualization/analysis codes and fine-tune the system for your detection task

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

jk-final

Objective:

Resources

About

Releases

Packages

Contributors 2

License

j93hahn/jk-final

Folders and files

Latest commit

History

Repository files navigation

jk-final

Objective:

Resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages