• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Skip to footer
Montreal AI Ethics Institute

Montreal AI Ethics Institute

Democratizing AI ethics literacy

  • Articles
    • Public Policy
    • Privacy & Security
    • Human Rights
      • Ethics
      • JEDI (Justice, Equity, Diversity, Inclusion
    • Climate
    • Design
      • Emerging Technology
    • Application & Adoption
      • Health
      • Education
      • Government
        • Military
        • Public Works
      • Labour
    • Arts & Culture
      • Film & TV
      • Music
      • Pop Culture
      • Digital Art
  • Columns
    • AI Policy Corner
    • Recess
    • Tech Futures
  • The AI Ethics Brief
  • AI Literacy
    • Research Summaries
    • AI Ethics Living Dictionary
    • Learning Community
  • The State of AI Ethics Report
    • Volume 7 (November 2025)
    • Volume 6 (February 2022)
    • Volume 5 (July 2021)
    • Volume 4 (April 2021)
    • Volume 3 (Jan 2021)
    • Volume 2 (Oct 2020)
    • Volume 1 (June 2020)
  • About
    • Our Contributions Policy
    • Our Open Access Policy
    • Contact
    • Donate

Beyond Empirical Windowing: An Attention-Based Approach for Trust Prediction in Autonomous Vehicles

February 5, 2024

🔬 Research Summary by Zhaobo Zheng, a scientist at Honda Research Institute USA, Inc.

[Original paper by Minxue Niu, Zhaobo Zheng, Kumar Akash, and Teruhisa Misu]


Overview: The trust in autonomous driving is critical for user experience and system efficiency. This paper utilizes a selective windowing attention network to augment user trust in autonomous driving. The novel model can also analyze and visualize the more important scenarios related to user trust changes.


Introduction

SWAN: A New Way to Understand Human Trust in Autonomous Vehicles (AV)

Do you want to know how you feel about autonomous driving? Do you want to improve your human-machine interaction design? Do you want to leverage the power of attention mechanisms to analyze long time-series data?

If you answered yes to any of these questions, then have a look at SWAN: a Selective Windowing Attention Network. SWAN is a novel neural network model that can estimate human trust in AV from multimodal signals, such as speech, facial expressions, and physiological data.

Unlike traditional windowing techniques requiring manual tuning and domain knowledge, SWAN can automatically select the most relevant data intervals for trust prediction. SWAN uses window prompts and masked attention transformation to focus on the critical span of trust changes while ignoring the irrelevant or noisy parts.

SWAN has been tested on a new multimodal driving simulation dataset where it outperformed existing baselines, such as CNN-LSTM and Transformer, by a large margin. SWAN also showed robustness across different windowing ranges, demonstrating its flexibility and adaptability.

SWAN is the augmented solution for human state estimation. With SWAN, you can visualize the underlying nuances of human trust. 

Key Insights

Trust is an important factor that affects how humans interact with machines, especially in safety-critical domains like AVs. However, trust is a gradual state that changes over time, and it is difficult to label and analyze long time-series data that capture trust variations.

One common technique to deal with long time-series data is windowing, which divides the data into fixed-size, overlapping segments and applies a model to each segment. However, windowing has some drawbacks, such as:

  • The model’s performance depends on the window size, which requires manual tuning and domain knowledge.
  • The window size is fixed, which may not capture the dynamic nature of trust changes.
  • The windowing process may introduce noise or loss of information.

To overcome these limitations, the paper introduces a Selective Windowing Attention Network (SWAN), a neural network model that can automatically select the most relevant data segments for trust prediction using attention mechanisms.

SWAN consists of three main components:

  • A window prompt generator creates a set of window prompts representing different input data segments with varying lengths and positions.
  • A masked attention transformer computes the attention scores between the window prompts and the input data and selects the most informative segments based on the scores.
  • A trust predictor aggregates the selected segments and outputs a trust score for the whole input data.

The paper evaluates SWAN on a new multimodal driving simulation dataset, where participants interacted with an AV system and reported their trust levels. The dataset contains speech, facial, and physiological signals and contextual information such as driving scenarios and events.

The paper compares SWAN with several baselines, including:

  • A CNN-LSTM model that applies a convolutional neural network (CNN) and a long short-term memory network (LSTM) to the whole input data.
  • A Transformer model that applies a transformer network to the whole input data.
  • A windowing-based model that applies a CNN-LSTM model to each window segment, and uses an empirical method to select the optimal window size.

The paper shows that SWAN outperforms the baselines regarding trust prediction accuracy and demonstrates robustness across different windowing ranges. The paper also provides some qualitative analysis and visualization of the attention scores and the selected segments, which reveal some interesting insights into the trust dynamics and the factors that influence trust.

The paper demonstrates that SWAN is a novel and effective method for trust estimation in AVs. It can be extended to other applications that involve human state modeling from long time-series data.

Between the lines

This paper may pave the road for a universal model for cognitive state detection through multimodal signals. Such a model may also provide Interpretability on what triggered cognitive state changes.

Want quick summaries of the latest research & reporting in AI ethics delivered to your inbox? Subscribe to the AI Ethics Brief. We publish bi-weekly.

Primary Sidebar

🔍 SEARCH

Spotlight

Illustration of a coral reef ecosystem

Tech Futures: Diversity of Thought and Experience: The UN’s Scientific Panel on AI

This image shows a large white, traditional, old building. The top half of the building represents the humanities (which is symbolised by the embedded text from classic literature which is faintly shown ontop the building). The bottom section of the building is embossed with mathematical formulas to represent the sciences. The middle layer of the image is heavily pixelated. On the steps at the front of the building there is a group of scholars, wearing formal suits and tie attire, who are standing around at the enternace talking and some of them are sitting on the steps. There are two stone, statute-like hands that are stretching the building apart from the left side. In the forefront of the image, there are 8 students - which can only be seen from the back. Their graduation gowns have bright blue hoods and they all look as though they are walking towards the old building which is in the background at a distance. There are a mix of students in the foreground.

Tech Futures: Co-opting Research and Education

Agentic AI systems and algorithmic accountability: a new era of e-commerce

ALL IN Conference 2025: Four Key Takeaways from Montreal

Beyond Dependency: The Hidden Risk of Social Comparison in Chatbot Companionship

related posts

  • Virtues Not Principles

    Virtues Not Principles

  • Survey of EU Ethical Guidelines for Commercial AI: Case Studies in Financial Services

    Survey of EU Ethical Guidelines for Commercial AI: Case Studies in Financial Services

  • Towards a Feminist Metaethics of AI

    Towards a Feminist Metaethics of AI

  • AI Ethics: Inclusivity in Smart Cities

    AI Ethics: Inclusivity in Smart Cities

  • Private Training Set Inspection in MLaaS

    Private Training Set Inspection in MLaaS

  • A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

    A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

  • Fair Generative Model Via Transfer Learning

    Fair Generative Model Via Transfer Learning

  • Unprofessional Peer Reviews Disproportionately Harm Underrepresented Groups in STEM (Research Summar...

    Unprofessional Peer Reviews Disproportionately Harm Underrepresented Groups in STEM (Research Summar...

  • Disaster City Digital Twin: A Vision for Integrating Artificial and Human Intelligence for Disaster ...

    Disaster City Digital Twin: A Vision for Integrating Artificial and Human Intelligence for Disaster ...

  • Computer Vision’s implications for human autonomy

    Computer Vision’s implications for human autonomy

Partners

  •  
    U.S. Artificial Intelligence Safety Institute Consortium (AISIC) at NIST

  • Partnership on AI

  • The LF AI & Data Foundation

  • The AI Alliance

Footer


Articles

Columns

AI Literacy

The State of AI Ethics Report


 

About Us


Founded in 2018, the Montreal AI Ethics Institute (MAIEI) is an international non-profit organization equipping citizens concerned about artificial intelligence and its impact on society to take action.

Contact

Donate


  • © 2025 MONTREAL AI ETHICS INSTITUTE.
  • This work is licensed under a Creative Commons Attribution 4.0 International License.
  • Learn more about our open access policy here.
  • Creative Commons License

    Save hours of work and stay on top of Responsible AI research and reporting with our bi-weekly email newsletter.