|
ai agent |
4 |
|
artificial general intelligence |
4 |
|
foundation models |
4 |
|
multimodal |
4 |
|
reasoning |
4 |
|
3d controllable generation |
3 |
|
diffusion model |
3 |
|
novel view synthesis |
3 |
|
3d avatar generation |
1 |
|
3d gaussians |
1 |
|
3d human |
1 |
|
benchmark and evaluation |
1 |
|
capability modeling |
1 |
|
categorization |
1 |
|
clustering |
1 |
|
compositional text-to-image generation |
1 |
|
downlink precoding design |
1 |
|
expressive animation |
1 |
|
image captioning |
1 |
|
image generation |
1 |
|
image-text correspondence |
1 |
|
language and vision |
1 |
|
local-global language association |
1 |
|
massive mimo |
1 |
|
mobile network measurement |
1 |
|
performance analysis |
1 |
|
person re-identification |
1 |
|
recognition: detection |
1 |
|
retrieval |
1 |
|
score distillation |
1 |
|
sum-rate |
1 |
|
text-image retrieval |
1 |
|
vision + language |
1 |