vis4d.model.track3d.cc_3dt
CC-3DT model implementation.
This file composes the operations associated with CC-3DT https://arxiv.org/abs/2212.01247 into the full model implementation.
Classes
|
CC-3DT with custom detection results. |
|
CC-3DT with Faster-RCNN detector. |
|
Output of CC-3DT model with Faster R-CNN detector. |
- class FasterRCNNCC3DTOut(detector_3d_out: Tensor, detector_3d_target: Tensor, detector_3d_labels: Tensor, qdtrack_out: FasterRCNNQDTrackOut)[source]
Output of CC-3DT model with Faster R-CNN detector.
-
detector_3d_out:
Tensor Alias for field number 0
-
detector_3d_target:
Tensor Alias for field number 1
-
detector_3d_labels:
Tensor Alias for field number 2
-
qdtrack_out:
FasterRCNNQDTrackOut Alias for field number 3
-
detector_3d_out:
- class FasterRCNNCC3DT(num_classes, basemodel=None, faster_rcnn_head=None, rcnn_box_decoder=None, qdtrack_head=None, track_graph=None, pure_det=False)[source]
CC-3DT with Faster-RCNN detector.
- __init__(num_classes, basemodel=None, faster_rcnn_head=None, rcnn_box_decoder=None, qdtrack_head=None, track_graph=None, pure_det=False)[source]
Creates an instance of the class.
- Parameters:
num_classes (int) – Number of object categories.
basemodel (BaseModel, optional) – Base model network. Defaults to None. If None, will use ResNet50.
faster_rcnn_head (FasterRCNNHead, optional) – Faster RCNN head. Defaults to None. if None, will use default FasterRCNNHead.
rcnn_box_decoder (DeltaXYWHBBoxDecoder, optional) – Decoder for RCNN bounding boxes. Defaults to None.
qdtrack_head (QDTrack, optional) – QDTrack head. Defaults to None. If None, will use default QDTrackHead.
track_graph (CC3DTrackGraph, optional) – Track graph. Defaults to None. If None, will use default CC3DTrackGraph.
pure_det (bool, optional) – Whether to use pure detection. Defaults to False.
- forward(images, images_hw, intrinsics, extrinsics=None, frame_ids=None, boxes2d=None, boxes3d=None, boxes3d_classes=None, boxes3d_track_ids=None, keyframes=None)[source]
Forward.
- Return type:
- class CC3DT(basemodel=None, qdtrack_head=None, track_graph=None, detection_range=None)[source]
CC-3DT with custom detection results.
- __init__(basemodel=None, qdtrack_head=None, track_graph=None, detection_range=None)[source]
Creates an instance of the class.
- Parameters:
basemodel (BaseModel, optional) – Base model network. Defaults to None. If None, will use ResNet50.
qdtrack_head (QDTrack, optional) – QDTrack head. Defaults to None. If None, will use default QDTrackHead.
track_graph (CC3DTrackGraph, optional) – Track graph. Defaults to None. If None, will use default CC3DTrackGraph.
detection_range (Sequence[float], optional) – Detection range for each class. Defaults to None.
- forward(images_list, images_hw, intrinsics_list, extrinsics_list, frame_ids, pred_boxes3d, pred_boxes3d_classes, pred_boxes3d_scores, pred_boxes3d_velocities)[source]
Forward inference stage.
Curretnly only work with single batch per gpu.
- Return type: