Understanding speech and scene with ears and eyes (USSEE)

Filter
Conference article in proceedings

Search results

  • 2024

    AV-PEA : Parameter-Efficient Adapter for Audio-Visual Multimodal Learning

    Radman, A. & Laaksonen, J., 2024, Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP. SciTePress, p. 730-737 8 p. (Proceedings of the International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
  • Diffusion-Based Multimodal Video Captioning

    Kainulainen, J., Guo, Z. & Laaksonen, J., 7 Dec 2024, Computer Vision – ACCV 2024 : 17th Asian Conference on Computer Vision, Hanoi, Vietnam, December 8–12, 2024, Proceedings, Part III. Springer, p. 148-165 (Lecture Notes in Computer Science; vol. 15474).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

  • Size-Modulated Deformable Attention in Spatio-Temporal Video Grounding Pipelines

    Tiwari, H., Pehlivan Tort, S. & Laaksonen, J., 3 Dec 2024, Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings. Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, C.-L., Bhattacharya, S. & Pal, U. (eds.). Springer, p. 308-324 (Lecture Notes in Computer Science; vol. 15318).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

  • Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer

    Arora, P., Pehlivan, S. & Laaksonen, J., 2024, 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. Calzolari, N., Kan, M.-Y., Hoste, V., Lenci, A., Sakti, S. & Xue, N. (eds.). European language resources distribution agency, p. 15823-15834 12 p. (International conference on computational linguistics)(LREC proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    26 Downloads (Pure)
  • 2023

    Anchor-Free Action Proposal Network with Uncertainty Estimation

    Pehlivan, S. & Laaksonen, J., 2023, Proceedings - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023. IEEE, p. 1853-1858 6 p. (Proceedings - IEEE International Conference on Multimedia and Expo; vol. 2023-July).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    1 Citation (Scopus)
    42 Downloads (Pure)
  • Deep Ensemble Learning with Frame Skipping for Face Anti-Spoofing

    Muhammad, U., Hoque, M. Z., Oussalah, M. & Laaksonen, J., 2023, 2023 Twelfth International Conference on Image Processing Theory, Tools and Applications (IPTA). IEEE, 6 p. ( International workshops on image processing theory, tools, and applications).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    1 Citation (Scopus)
  • Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision

    Wang, T.-J. J., Laaksonen, J., Langer, T., Arponen, H. & Bishop, T., 6 Feb 2023, Proceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023. IEEE, p. 1073-1083 11 p. (IEEE Winter Conference on Applications of Computer Vision).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    3 Citations (Scopus)
    53 Downloads (Pure)
  • PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting

    Guo, Z., Wang, T. J. J., Pehlivan, S., Radman, A. & Laaksonen, J., 19 Jul 2023, SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, p. 2261-2265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    1 Citation (Scopus)
    83 Downloads (Pure)
  • 2022

    CLIP4IDC: CLIP for Image Difference Captioning

    Guo, Z., Wang, T.-J. J. & Laaksonen, J., Nov 2022, Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP). Association for Computational Linguistics, Vol. 2. p. 33-42

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    65 Downloads (Pure)
  • Post-Attention Modulator for Dense Video Captioning

    Guo, Z., Wang, T.-J. J. & Laaksonen, J., 2022, Proceedings of the 26th International Conference on Pattern Recognition (ICPR). IEEE, p. 1536-1542 (International Conference on Pattern Recognition).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    1 Citation (Scopus)
    61 Downloads (Pure)
  • Tracing Signs of Urbanity in the Finnish Fiction Film of the 1950s: Toward a Multimodal Analysis of Audiovisual Data

    Grósz, T., Kallioniemi, N., Kiiskinen, H., Laine, K., Moisio, A., Römpötti, T., Virkkunen, A., Salmi, H., Kurimo, M. & Laaksonen, J., 2022, Proceedings of the 6th Digital Humanities in the Nordic and Baltic Countries Conference (DHNB 2022), Long Papers. CEUR, Vol. 3232. p. 63-78 16 p. (CEUR Workshop Proceedings; no. 3232).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    1 Citation (Scopus)
    110 Downloads (Pure)
  • When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity

    Alnajjar, K., Hämäläinen, M., Tiedemann, J., Laaksonen, J. & Kurimo, M., Oct 2022, Proceedings of the 29th International Conference on Computational Linguistics. International Committee on Computational Linguistics, p. 6875-6886 12 p. (Proceedings of the International Conference on Computational Linguistics).

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    Open Access
    File
    78 Downloads (Pure)