Challenge and Dataset on Large-scale Human-centric Video Analysis in Complex Events (HiEve) |
bytetrack using PAKF
5FPS
2
Benchmark
performance:
MOTA | w_MOTA | IDF1 | MT | ML | FP | FN | ID_Sw | ID_Sw_DT | Frag | MOTP |
---|---|---|---|---|---|---|---|---|---|---|
47.3943 | 42.9716 | 54.8539 | 19.5173 | 32.9486 | 4477 | 31576 | 600 | 77 | 1532 | 77.9341 |
Detailed
performance:
Video ID | MOTA | w_MOTA | IDF1 | MT | ML | FP | FN | ID_Sw | ID_Sw_DT | Frag | MOTP |
---|---|---|---|---|---|---|---|---|---|---|---|
20 | 64.7276 | 64.3921 | 72.2234 | 51.0204 | 18.3673 | 1545 | 2634 | 29 | 1 | 136 | 78.6099 |
21 | 6.3809 | 2.3809 | 24.3865 | 0.0000 | 60.0000 | 114 | 816 | 9 | 1 | 19 | 61.3933 |
22 | 30.9776 | 26.0378 | 31.8647 | 10.6796 | 39.8058 | 286 | 6218 | 196 | 12 | 421 | 72.9508 |
23 | 66.2801 | 60.9194 | 59.0931 | 50.0000 | 14.2857 | 469 | 1013 | 30 | 6 | 88 | 76.9214 |
24 | 14.4963 | 5.5544 | 22.3725 | 2.7356 | 63.2219 | 246 | 11142 | 96 | 30 | 178 | 74.3700 |
25 | 39.1140 | 33.8482 | 63.7871 | 0.5747 | 1.7241 | 900 | 2788 | 9 | 8 | 111 | 77.1240 |
26 | 55.0269 | 52.8769 | 44.7382 | 41.1765 | 29.4118 | 52 | 1591 | 30 | 2 | 75 | 81.6468 |
27 | 76.6778 | 71.1660 | 68.5561 | 67.7419 | 6.4516 | 137 | 984 | 64 | 7 | 165 | 81.9758 |
28 | 56.7706 | 45.7165 | 57.3196 | 20.0000 | 40.0000 | 181 | 882 | 32 | 7 | 78 | 75.9866 |
29 | 64.0143 | 62.2101 | 62.6463 | 64.0000 | 12.0000 | 112 | 656 | 37 | 1 | 89 | 80.1520 |
30 | 58.4972 | 58.4972 | 70.0154 | 28.8889 | 0.0000 | 43 | 764 | 16 | 0 | 51 | 78.4970 |
31 | 83.6207 | 83.6207 | 89.9905 | 86.6667 | 4.4444 | 83 | 275 | 3 | 0 | 23 | 81.3173 |
32 | 58.9603 | 57.4489 | 59.1230 | 30.0000 | 27.5000 | 309 | 1813 | 49 | 2 | 98 | 76.9458 |