模型仓库modelzoo

Classification

networkfloatqatquantizationdatasetinput shape
mobilenetv1_imagenet74.1273.9273.61ImageNet1x3x224x224
mobilenetv2_imagenet72.6572.5172.11ImageNet1x3x224x224
resnet18_imagenet72.0472.0372.03ImageNet1x3x224x224
resnet50_imagenet77.3776.9976.94ImageNet1x3x224x224
vargnetv2_imagenet73.9473.5673.64ImageNet1x3x224x224
efficientnet_imagenet74.3174.2374.18ImageNet1x3x224x224
horizon_swin_transformer_imagenet80.2480.1580.05ImageNet1x3x224x224
mixvargenet_imagenet71.3371.2371.04ImageNet1x3x224x224
efficientnasnetm_imagenet80.2479.9979.94ImageNet1x3x280x280
efficientnasnets_imagenet76.6376.2376.03ImageNet1x3x300x300
vit_small_imagenet79.5079.40-ImageNet1x3x224x224
henet_tinye_imagenet77.6877.22-ImageNet1x3x224x224
henet_tinym_imagenet78.3877.95-ImageNet1x3x224x224

Detection

FCOS

networkbackbonefloatqatquantizationdatasetinput shape
fcos_efficientnetb0_mscocoefficientnetb036.2635.7935.59MS COCO1x3x512x512
fcos_efficientnetb1_mscocoefficientnetb141.3741.2140.71MS COCO1x3x640x640
fcos_efficientnetb2_mscocoefficientnetb245.3545.1045.00MS COCO1x3x768x768
fcos_efficientnetb3_mscocoefficientnetb348.0347.6547.58MS COCO1x3x896x896

DETR

networkbackbonefloatqatquantizationdatasetinput shape
detr_resnet50_mscocoresnet5035.7031.4231.31MS COCO1x3x800x1333
detr_efficientnetb3_mscocoefficientnetb337.2135.9535.99MS COCO1x3x800x1333

Deform DETR

networkbackbonefloatqatquantizationdatasetinput shape
deform_detr_resnet50_mscocoresnet5044.3444.65-MS COCO1x3x800x1333

FCOS3D

networkbackbonefloatqatquantizationdatasetinput shape
fcos3d_efficientnetb0_nuscenesefficientnetb030.6030.2730.31nuscenes1x3x512x896

Segmentation

UNet

networkbackbonefloatqatquantizationdatasetinput shape
unet_mobilenetv1_cityscapesMobileNetV168.0267.5667.53Cityscapes1x3x1024x2048

Deeplab

networkbackbonefloatqatquantizationdatasetinput shape
deeplabv3plus_efficientnetm0_cityscapesEfficientNet-M076.3076.2276.12Cityscapes1x3x1024x2048
deeplabv3plus_efficientnetm1_cityscapesEfficientNet-M177.9477.6477.65Cityscapes1x3x1024x2048
deeplabv3plus_efficientnetm2_cityscapesEfficientNet-M278.8278.6578.63Cityscapes1x3x1024x2048

FastScnn

networkbackbonefloatqatquantizationdatasetinput shape
fastscnn_efficientnetb0tiny_cityscapesEfficientNet-B0lite69.9769.9069.88Cityscapes1x3x1024x2048

OpticalFlow

PwcNet

networkbackbonefloatqatquantizationdatasetinput shape
pwcnet_pwcnetneck_flyingchairsPwcNet1.41171.41121.4075FlyingChairs1x6x384x512

Lidar

PointPillars

networkbackbonefloatqatquantizationdatasetinput shape
pointpillars_kitti_carSequentialBottleNeck77.3176.8676.76KITTI3D150000x4

CenterPoint

networkbackbonefloatqatquantizationdatasetinput shape
centerpoint_pointpillar_nuscenesSequentialBottleNeck58.3258.1158.14nuscenes1x5x20x40000, 40000x4

LidarMultiTask

networkbackbonefloatqatquantizationdatasetinput shape
centerpoint_mixvargnet_multitask_nuscenesMixVarGENet58.0957.7257.62nuscenes1x5x20x40000, 40000x4
注解

PointPillars 的指标是 Box3d Moderate 这项。

Lane Detection

GaNet

networkbackbonefloatqatquantizationdatasetinput shape
ganet_mixvargenet_culaneMixVarGENet79.4978.7278.72CuLane1x3x320x800

Multiple Object Track

Motr

networkbackbonefloatqatquantizationdatasetinput shape
motr_efficientnetb3_mot17efficientnetb358.0257.6257.76Mot171x3x800x1422, 1x256x2x128, 1x1x1x256, 1x4x2x128

Binocular depth estimation

StereoNet

networkbackbonefloatqatquantizationdatasetinput shape
stereonet_stereonetneck_sceneflowStereoNeck1.12701.16771.1685SceneFlow1x6x540x960
stereonetplus_mixvargenet_sceneflowMixVarGENet1.12701.13291.1351SceneFlow2x3x544x960

Bev

Bev

networkbackbonefloatqatquantizationdatasetinput shape
bev_ipm_efficientnetb0_multitask_nuscenesefficientnetb030.5930.8030.41nuscenes det6x3x512x960, 6x128x128x2
bev_ipm_efficientnetb0_multitask_nuscenesefficientnetb051.4751.4150.98nuscenes seg6x3x512x960, 6x128x128x2
bev_lss_efficientnetb0_multitask_nuscenesefficientnetb030.0930.0530.01nuscenes det6x3x256x704, 10x128x128x2, 10x128x128x2
bev_lss_efficientnetb0_multitask_nuscenesefficientnetb051.7851.4751.46nuscenes seg6x3x256x704, 10x128x128x2, 10x128x128x2
bev_gkt_mixvargenet_multitask_nuscenesMixVarGENet28.1128.1227.90nuscenes det6x3x512x960, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2
bev_gkt_mixvargenet_multitask_nuscenesMixVarGENet48.5348.0248.37nuscenes seg6x3x512x960, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2, 6x64x64x2
bev_ipm_4d_efficientnetb0_multitask_nuscenesefficientnetb037.2137.1937.17nuscenes det6x3x512x960, 6x128x128x2, 1x64x128x128, 1x128x128x2
bev_ipm_4d_efficientnetb0_multitask_nuscenesefficientnetb052.9053.8053.77nuscenes seg6x3x512x960, 6x128x128x2, 1x64x128x128, 1x128x128x2
detr3d_efficientnetb3_nuscenesefficientnetb334.0433.8733.39nuscenes det6x3x512x1408
petr_efficientnetb3_nuscenesefficientnetb337.6037.3237.31nuscenes det6x3x512x1408
bevformer_tiny_resnet50_detection_nuscenesresnet5037.0036.66-nuscenes det6x3x480x800, 1x2500x256, 6x1x2500x4x2, 1x50x50x2
bev_cft_efficientnetb3_nuscenesefficientnetb332.9332.68-nuscenes det6x3x512x1408
bev_sparse_resnet50_nuscenesefficientnetb356.2855.23-nuscenes det6x3x256x704, 6x4x4, 1x600x11, 1x600x256, 1x600, 1

Keypoint Detection

HeatmapKeypointModel

networkbackbonefloatqatquantizationdatasetinput shape
keypoint_efficientnetb0_carfusionefficientnetb094.3394.3094.31carfusion1x3x128x128

Trajectory Prediction

DenseTNT

networkbackbonefloatqatquantizationdatasetinput shape
densetnt_vectornet_argoverse1vectornet1.29741.29891.3038argoverse 130x9x19x32, 30x11x9x64, 30x1x1x96, 30x2x1x2048, 30x1x1x2048

QCNet

networkbackbonefloatqatquantizationdatasetinput shape
qcnet_oe_argoverse2-83.8483.26-argoverse 2输入见下方list
注解

qcnet_oe_argoverse2 的指标是 HitRate 这项。

qcnet_oe_argoverse2 模型输入shape为:

1x30x50, 1x50x30x30, 1x30x1, 1x1x30x1, 1x1x30x1, 1x1x30x1, 1x1x30x1, 1x1x30x80, 1x1x30x80, 1x1x30x80, 1x1x30x10, 1x1x30x10, 1x1x30x10, 1x1x30x10, 1x1x30x30, 1x1x30x30, 1x1x30x30, 1x30x29x128, 1x30x10x128, 1x30x10x128, 1x80, 1x80, 1x1x80x80, 1x1x80x80, 1x1x80x80, 1x1x80x50, 1x1x80x50, 1x1x80x50, 1x80x50, 1x80x50, 1x80x50, 1x80x50, 1x30x30, 1x30x1, 1x1x30x30, 1x1x30x30, 1x1x30x30, 1x1x30x30, 1x1x30x80, 1x1x30x80, 1x1x30x80, 1x1x30x30, 1x1x30x30, 1x1x30x30, 1x80x80.

Occupancy Prediction

FlashOcc

networkbackbonefloatqatquantizationdatasetinput shape
flashocc_henet_lss_occ3d_nusceneshenet_tinym_imagenet0.36460.3622-occ3d_nuscenes6x3x512x960, 10x128x128x2, 10x128x128x2

Online Map Construction

MapTR

networkbackbonefloatqatquantizationdatasetinput shape
maptrv2_resnet50_bevformer_nuscenesresnet500.58590.58430.5763nuscenes6x3x480x800, 6x1x2500x4x2