Publications

Tapu, R., Mocanu, B., & Chiva, I. C. Multimodal Visual Speech Recognition for Under-Resource Languages via Cross-Modal Learning and Large Language Models. Romanian Journal of Information Science and Technology (ROMJIST), Q1 journal, IF 3.9.
Tănase, C.-A., Dumitrescu, A., Trasca, A. B. D., & Mocanu, B. (2025). Lightweight Machine Learning Models for UWB Localization and Gesture Recognition. In Proceedings of the 28th International Symposium on Wireless Personal Multimedia Communications (WPMC 2025).
Dumitrescu, A., Tănase, C.-A., Vaduva, M., & Mocanu, B. (2025). Enhancing eCall Systems with LLM-Powered First Aid Guidance and Follow-up Information. In Proceedings of the 28th International Symposium on Wireless Personal Multimedia Communications (WPMC 2025).
Tapu, R., & Mocanu, B. (2025). Automatic Audio Description: A Training-Free Approach Using Foundation Models. In Proceedings of the 21st International Conference on Computer Analysis of Images and Patterns (CAIP 2025) (Vol. 15622, pp. 173–183).
Tapu, R., & Mocanu, B. (2025). Lip Reading Across Languages: A Cross-Modal Framework Leveraging Foundation Models. In Proceedings of the 2025 IEEE International Conference on Content-Based Multimedia Indexing (CBMI 2025).
Mocanu, B., & Tapu, R. (2025). A Lightweight Audio-Visual Speaker Detection System for Assistive Video Captioning. In Proceedings of the 2025 13th European Workshop on Visual Information Processing (EUVIP 2025).
Mocanu, B., & Tapu, R. (2025). Seeing Through Words: A Zero-Shot Multimodal Audio Description System with Foundation Models. In Proceedings of the 20th International Symposium on Visual Computing (ISVC 2025). Springer.
Grosu, M., Mocanu, B., Tapu, R., & Datcu, O. (2025). Evaluating Speech Emotion Recognition Systems: From Traditional Low-Level Features to Transformer-Based Models. In Proceedings of the International Conference on E-Health and Bioengineering (EHB 2025).
Constantin, O., Tapu, R., Mocanu, B., & Grosu, M. (2025). Food Image Recognition: From CNNs to Transformers and Multimodal Learning. In Proceedings of the International Conference on E-Health and Bioengineering (EHB 2025).
Ionescu, B., Müller, H., Stanciu, D.-C., Andrei, A.-G., Radzhabov, A., Prokopchuk, Y., … Stein, B. (2025). Overview of ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications. In Proceedings of the 16th International Conference of the CLEF Association (CLEF 2025) (pp. 290–314). Springer-Verlag. https://doi.org/10.1007/978-3-032-04354-2_17
Andrei, A.-G., Constantin, M. G., Dogariu, M., Radzhabov, A., Ștefan, L.-D., Prokopchuk, Y., … Ionescu, B. (2025). Overview of ImageCLEFMedical 2025 GANs Task: Training Data Analysis and Fingerprint Detection. In CLEF2025 Working Notes.
Andrei, A., Constantin, M. G., Dogariu, M., Ștefan, L.-D., & Ionescu, B. (2025). AI Multimedia Lab at ImageCLEFMedical GANs 2025: Identifying Real-Image Usage in Generated Medical Images. In CLEF2025 Working Notes.