Trung Thanh Nguyen

🔬 PhD Candidate @ Nagoya University | Student Researcher @ RIKEN

prof_pic.jpg

I am a PhD candidate at Nagoya University, specializing in the Department of Intelligent Systems. My research focuses on vision-language models, multimodal recognition, and video captioning, with applications in solving real-world problems.

Currently, I am a student researcher at RIKEN National Science Institute, working on the Guardian Robot Project. My research involves open-world action detection and multi-view multi-modal action recognition by analyzing multimodal sensory data.

Additionally, I am in charge at the Center for Artificial Intelligence, Mathematical and Data Science, collaborating with Japanese corporations to develop practical AI solutions.

đź“© Contact: nguyent (at) cs.is.i.nagoya-u.ac.jp

Google Scholar   LinkedIn

news

Apr 01, 2025 Renewed Qualified Teaching Assistant certification (valid for 1 year) in higher education teaching from the QTA/GSI Training Center, Tokai National Higher Education and Research System, Japan.
Mar 31, 2025 I received a certificate for supporting Toyota Industries Corporation from the MDA Center, Japan.
Mar 24, 2025 I was invited to Academia Sinica (Taiwan) for a summer school on nonstationary time series analysis for biomedical artificial intelligence.
Mar 18, 2025 I presented our paper on “Multi-modal Multi-view Action Recognition” at Shiga University, Japan.
Mar 03, 2025 I presented our CPDM paper at IEEE/CVF WACV2025, United States.
Feb 28, 2025 Our Grand Challenge proposal “IntentVC” has been accepted at ACM MM2025.
Jan 23, 2025 On a business trip to RIKEN GRP until Feb 21, Japan.
Dec 04, 2024 I presented our MultiASL paper at ACM MMAsia2024, New Zealand.
Oct 01, 2024 I am starting my PhD at Nagoya University, Japan.
Sep 27, 2024 I graduated with a Master's degree as the Honorary Valedictorian of the Graduate School of Informatics, Nagoya University, Japan. YouTube Logo

selected publications

  1. IEEE FG
    2025_FG_GA.jpg
    MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
    Trung Thanh Nguyen, Yasutomo Kawanishi, Vijay John, and 2 more authors
    In Proceedings of the 19th IEEE International Conference on Automatic Face and Gesture Recognition, 2025
  2. IEEE Access
    2024_IEEEACCESS_GA.jpg
    Zero-shot Pill-Prescription Matching with Graph Convolutional Network and Contrastive Learning
    Trung Thanh Nguyen, Phi Le Nguyen, Yasutomo Kawanishi, and 2 more authors
    IEEE Access, 2024
  3. ACM MMAsia
    2024_MMAsia_GA.jpg
    Action Selection Learning for Multi-label Multi-view Action Recognition
    Trung Thanh Nguyen, Yasutomo Kawanishi, Takahiro Komamizu, and 1 more author
    In Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024
  4. IEEE FG
    2024_FG_GA.jpg
    One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-Scale and Action Label Features
    Trung Thanh Nguyen, Yasutomo Kawanishi, Takahiro Komamizu, and 1 more author
    In Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024
  5. IEEE TNSM
    2022_TNSM_GA.png
    Fuzzy Q-Learning-Based Opportunistic Communication for MEC-Enhanced Vehicular Crowdsensing
    Trung Thanh Nguyen, Truong Thao Nguyen, Thanh-Hung Nguyen, and 1 more author
    IEEE Transactions on Network and Service Management, 2022