Want to create interactive content? It’s easy in Genially!
Get started free
Demo
Saffron Zainchkovskaya
Created on January 8, 2024
Start designing with a free template
Discover more than 1500 professional designs like these:
Transcript
Recommender systems: exploring the effects of interpretable explainability on popular performance metrics
Saffron Zainchkovskaya
Dr Sunčica Hadžidedić
Literature
Issues:
- Single domain exploration [3,5,6,7,8,11]
 - Lack of performance metrics available - mostly user study based
 - (not as conclusive)
 - Lack of the effect of addition of explainability on common metrics like RMSE, precision, and recall.
 - Some papers ignore standard performance metrics
 - Severe lack of human-interpretable explanations
 - Knowledge graphs
 - Cluster graphs
 - Latent factors
 
aims for this project
- Understand Explainability in RSs Across Different Domains
 - Identifying any domain-specific challenges
 - Examine the effects of integrating explainability features on standard performance metrics, across multiple domains.
 - Address the lack of knowledge surrounding the effect of differing explainability incorporation on a multitude of evaluation metrics.
 - Transforming common explanability types to interpretable versions
 - Use metrics to evaluate such explainations.
 
OLd DELIVERABLES
ADVANCED
INTERMEDIATE
BASIC
- Launch a complete mobile application for the RSs in the high impact domain.
 - Execute a comprehensive user studyxplainability and trust in the RS.
 
- Complete data collection and cleaning for the high impact domain
 - Expand the RS & ERS to a front end application
 - Implement the back end for the high impact domain (RS*& ERS*).
 - Perform offline comparisons to assess the impact of domain on explainability and trust.
 
- Complete data collection and cleaning for the low impact domain
 - Develop a baseline recommendation system (RS) and a explinable RS (ERS) in the low impact domain.
 - Conduct offline tests to compare the explainable and trustworthy RS against the baseline
 
NEW DELIVERABLES
ADVANCED
INTERMEDIATE
BASIC
- Use GPT-3 API to process explanations produced by models to a more interpretable format.
 - Test this novel approach by executing a user focus group on the types of explainability produced.
 
- Complete data collection and cleaning for the high-impact domain
 - Implement the back end for the high-impact domain (RS*& ERS*).
 - Perform offline comparisons to assess the impact of the domain on explainability and performance metrics.
 
- Research the optimal techniques for explainability and evaluation
 - Complete data collection and cleaning for the low-impact domain
 - Develop a baseline recommendation system (RS) and an explainable RS (ERS) in the low-impact domain.
 - Conduct offline tests to compare the explainable and trustworthy RS against the baseline
 
Progress ON DELIVERABLES
ADVANCED
INTERMEDIATE
BASIC
- Use ChatGPT API to process explanations produced by models to a more user-friendly format
 - Test this novel approach by executing a user focus group on the types of explainability produced.
 
- Complete data collection and cleaning for the high-impact domain
 - Implement the back end for the high-impact domain (RS*& ERS*).
 - Perform offline comparisons to assess the impact of the domain on explainability and performance metrics.
 
- Research the optimal techniques for explainability and evaluation
 - Complete data collection and cleaning for the low-impact domain, including text pre-processing using transformers (BERT)
 - Develop a baseline recommendation system (RS) and an explainable RS (ERS) in the low-impact domain.
 - Conduct offline tests to compare the explainable and trustworthy RS against the baseline
 
gpt 3 explanation with latent factors
Milestones achieved pt 1
- Thorough research of explainability metrics
 - Collected extensive data on thids
 - Tested GPT3 for producing explanations succesfully from latent factors
 - Planned future steps
 
- Completed data collection for the low-impact domain
 - Collected IMDB data using web scraping
 - Completed data cleaning for the low-impact domain
 - Processed IMDB data to retrieve descriptions, directors, and actors.
 - Used BERT transformer to generate embeddings for descriptions.
 - Completed data analysis for the low-impact domain
 - Analysed the produced data, ready to be fed into the RS
 
Project Plan Overview
Comparison with Actual Progress
Adjustments Made
- Focused on back end before front end
 - Revised the gantt chart for more appropriate completion
 - Revised the time spent on data collection and analysis
 - Revised front end
 - Proposed new user study format
 - Proposed the use of a novel technique of using ChatGPT API to process the recommendations.
 
Challenges and Solutions
- Challenge: Data Accessibility Issues
 - accessing a robust dataset for the high-impact domain.
 - The initial attempt to scrape data from Glassdoor was met with technical roadblocks
 - IP address getting blocked repeatedly.
 - Solution: Innovative Data Collection Strategies
 - Researched and found external application
 - designed for dynamic web scraping, which helped bypass the IP blocking issue.
 
- Thought of strategy to periodically change our IP addresses
 - allowing to continue data collection without interruptions. (currently working on)
 - As a backup, a base dataset/ synthetic data.
 
Next Steps
- Research Latent Factors method
 - Complete LF for ML
 - Create Explanations for low impact
 - Evaluate on metrics
 - Repeat for high impact
 
THANK YOU!
Recommender systems: exploring the effects of explainability on user trust
SAFFRON ZAINCHKOVSKAYA