All Stories

  1. MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
  2. THE-FD: Task Hierarchical Emotion-aware for Fake Detection
  3. Tracing Training Progress: Dynamic Influence Based Selection for Active Learning
  4. Enhancing Unsupervised Visible-Infrared Person Re-Identification with Bidirectional-Consistency Gradual Matching
  5. IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration
  6. VTQAGen: BART-based Generative Model For Visual Text Question Answering
  7. Multi-task Pre-training Language Model for Semantic Network Completion
  8. Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization
  9. Seeing Speech: Magnetic Resonance Imaging-Based Vocal Tract Deformation Visualization Using Cross-Modal Transformer
  10. Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion
  11. Multimodal Deep Learning for Social Media Popularity Prediction With Attention Mechanism
  12. A Quantitative Comparison of Different Machine Learning Approaches for Human Spermatozoa Quality Prediction Using Multimodal Datasets
  13. Multi-Scale Generalized Attention-Based Regional Maximum Activation of Convolutions for Beauty Product Retrieval
  14. Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder
  15. Articulatory feature extraction from ultrasound images using pretrained convolutional neural networks
  16. Deep Convolutional Neural Network-Based Early Automated Detection of Diabetic Retinopathy Using Fundus Image
  17. Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images
  18. Is Speckle Tracking Feasible for Ultrasound Tongue Images?
  19. Cyclic-feature based Doppler scale estimation for orthogonal frequency-division multiplexing (OFDM) signals over doubly selective underwater acoustic channels
  20. An Articulatory-Based Singing Voice Synthesis Using Tongue and Lips Imaging
  21. SU-F-J-04: Automated Detection of Diabetic Retinopathy Using Deep Convolutional Neural Networks
  22. SU-F-J-226: Structural Similarity-Based Ultrasound Image Similarity Measurement
  23. SU-G-JeP1-03: Automatic Motion Tracking Reset in Ultrasound Liver Image Sequences
  24. SU-G-IeP3-08: Image Reconstruction for Scanning Imaging System Based On Shape-Modulated Point Spreading Function
  25. Contour-based 3D tongue motion visualization using ultrasound image sequences