VidPP
Saffron Zainchkovskaya
Created on October 31, 2023
Over 30 million people build interactive content in Genially.
Check out what others have designed:
TALK ABOUT DYS WITH TEACHER
Presentation
CIRQUE DU SOLEIL
Presentation
LAYOUT ORGANIZATION
Presentation
TALK ABOUT DYS TEACHER-TEACHER
Presentation
PRODUCT MANAGEMENT IN MOVIES & TV SHOWS
Presentation
ESSENTIAL OILS PRESENTATION
Presentation
VEGETARIANISM
Presentation
Transcript
Dr Sunčica Hadžidedić
Saffron Zainchkovskaya
Recommender systems: exploring the effects of explainability on user trust
Motivation
Planning
Deliverables
Issues
Team
Approach
PROJECT OVERVIEW
Problem Statement and the focus of the project
Background
INTRODUCTION
VIDEO OVERVIEW
[1]Hallinan, B., & Striphas, T. (2016). Recommended for you: The Netflix Prize and the production of algorithmic culture. New Media & Society, 18(1), 117-137. https://doi.org/10.1177/1461444814538646
BACKGROUND
- Recommender Systems (RSs) are algorithms that provide personalised recommendations to users.
- RSs analyse patterns and infer preferences from historical user data. Such as:
- past purchases
- viewing histories
- ratings
- Popularised partially due to the Netflix $1 million competition. [1]
- to build an algorithm that beats Netflix's baseline algorithm by 10 percent.
- Commoly used in large producs used on a daily basis:
- e-commerce ( Amazon )
- entertainment ( Spotify )
- content consumption ( TikTok )
- RS operate as "black boxes"
- making it challenging to understand how recommendations are generated.
- The lack of explainability in RSs could lower user trust and satisfaction.
- As users may feel uneasy about trusting recommendations
- RSs have far-reaching implications for people's daily lives.
- There is a growing need for greater awareness of its effects.
- RSs can be used in both high impact domains
- High impact domain: has high impact on a users' life
- Alters the trajectory of their life
- E.G. Job searching
USER TRUST
RSs HAVE HIGH IMPACT
LACK OF TRANSPARENCY
ISSUES
MOTIVATION
- Motivation behind this project
- Understand the impact of explainability on the end user on a deeper level, using multiple testing techniques
- To assess the impacts of an increased level of explainability on the levels of a user’s trust with the system.
- See whether there is a difference when the impact level of the domain is different.
- WHY
- By addressing user trust concerns we can enhance user satisfaction and system adoption
- benefitting both users and organisations.
Project probleM
- Central problem addressed by this project = the lack of knowledge surrounding the effects of the incorporation of explainability within RSs
- and the impact this has on a user’s trust level with the system
- Also addresses the problem seen commonly within literature
- where only one domain is being investigated
USERS HESITATE TO RELY ON RSs
VITAL TO BOTH USERS AND ORGANISATIONS
RSs USED OFTEN & INFLUENCE US DAILY
Project Importance
- The importance of addressing this problem lies in the growing role of RSs in our daily lives
- RSs shape user experiences and decision-making processes
- Without transparency and trust, users may hesitate to rely on RS recommendations,
- leading to missed opportunities for businesses and a less satisfying user experience.
- Addressing these issues is crucial
- Users may lose trust in the system and feel dissatisfied with the suggestions provided.
- The project is unique as it systematically evaluating the effects of explainability on user trust within both low and high impact domains
- provides a comprehensive understanding of the relationship between explainability and trust in RSs.
RESEARCH QUESTION
To what extent does the integration of explainability into a recommendation system enhance the level of user trust, spanning both high and low-impact domains?
DEVELOP 4 RS BACK END
CONDUCT OFFLINE TESTS
Multi-faceted approach
CONDUCT USER STUDIES
DEVELOP THE FRONT END
- Multi-faceted approach
- Project's methodology
- Develop 4 RSs
- 2 low impact, 1 explainable, 1 base
- 2 high impact, 1 explainable, 1 not
- conduct a series of rigorous offline tests
- conduct a number of qualitative user studies
- Evaluate the results of both tests to draw conclusions about the RQ at hand.
How It Will Be Addressed
- Create a mobile application front end for the high impact domain, integrating the backend RS* and ETRS* to it.
- Conduct a user study on all developed RSs on at least 20 users.
- Compare RS1 and ETRS to RS1* and ETRS*, using the qualitative (user-study) tests to identify the influence of the domain on the incorporation of explainability and user trust in RSs.
- Create a mobile application front end for low impact domain, integrating the backend RS and ETRS to it.
- Implement both RSs in the high impact domain [RS1*, ETRS*].
- Compare ETRS* to a baseline RS1*, using quantitative measurements using offline tests.
- Compare RS1 and ETRS to RS1* and ETRS*, using the quantitative, offline tests to identify the influence of the domain on the incorporation of explainability and user trust in RSs.
- Select appropriate algorithms and metrics for explainability and user trust.
- Create the back end for the baseline RS (RS1) in the low impact domain.
- Create the back end for an explainable, and trustworthy RS [ETRS] in a low impact domain.
- Compare ETRS to a baseline RS1, using quantative measurements in an offline test.
INTERMEDIATE
ADVANCED
BASIC
DELIVERABLES
PLANNING
PLANNING
SUCCESS METRICS- PRODUCT
Evaluation Criteria:
- Accuracy: How accurate is the RSs? Has the incorporation of explainability lowered the accuracy?
- Explainability: Is the incorporation of explainable recommendations within systems beneficial to users?
- User Trust: Does this incorporation enhance the trust users have with the system?
- Domains: Is the incorporation of explainations more important in a high/low impact domain?
- Project Completion: How effectively was the project completed within the stipulated timeframe and project scope? Does the completion status of the project reflect on the quality and reliability of the RSs?
- The project will be evaluated on a bi-weekly basis to ensure it is consistently on track.
- During this evaluation the project components below will be examined
SUCCESS METRICS- PROJECT
PROJECT EVALUATION
Evaluation Methodologies:The product will be evaluated using three main methodologies.
- User Study: Conduct a series of user studies on 20 or more individuals
- Offline Test: Test the system fully on the metrics mentioned prior.
- Comparison: Process and compare the results of each of the studies across the differing domain to draw conclusions for the project.
- Evaluate the effect of the proposed methodology when comparing the results
OFFLINE TEST
COMPARISON
USER STUDY
PROJECT EVALUATION
- User study performed
- 20+ participants
- Process:
- Let user interact with the RS (Observed)
- Questions are permitted but users arent guided through the process
- Users interact base and ETRS in one of the two domains (randomised order)
- Questionnaire
- Break
- Users then interact with the base and ETRS in the other domain
- Questionnaire
Project Evlauation - USER STUDY
Recommender systems: exploring the effects of explainability on user trust
SAFFRON ZAINCHKOVSKAYA
THANK YOU!
SUCCESS METRICS- PRODUCT
Evaluation Criteria:
- Accuracy: How accurate is the RSs? Has the incorporation of explainability lowered the accuracy?
- Explainability: Is the incorporation of explainable recommendations within systems beneficial to users?
- User Trust: Does this incorporation enhance the trust users have with the system?
- Domains: Is the incorporation of explainations more important in a high/low impact domain?
- Project Completion: How effectively was the project completed within the stipulated timeframe and project scope? Does the completion status of the project reflect on the quality and reliability of the RSs?
- The project will be evaluated on a bi-weekly basis to ensure it is consistently on track.
- During this evaluation the project components below will be examined
SUCCESS METRICS- PROJECT
PROJECT EVALUATION