專題演講 主講人:花文妤博士(Amazon machine learning scientist) 

  • 事件日期: 2020-10-29
  • 演講者:  /  主持人:
This is an image


題 目:Similarity Recommendation based on the Attention Mechanism

主講人:花文妤博士(Amazon machine learning scientist) 

(交大統計所校友,美國賓州州立大學博士)   
   
時 間:109年10月29日(星期四)上午11:00-11:50
(上午10:40-11:00茶會於交大統計所428室舉行) 

地 點:交大綜合一館427室
摘要
Item-to-item similarity has been long used for building recommender systems in industrial settings, owing to its interpretability and real-time computational productivity. In this work, we have developed a new embedding representation to the similarity-based recommendation system. The proposed solution enhances the information to both text embedding and image embedding. First of all, we have successfully improved the text embedding in two ways: 1) add item description and bullet points on top of the title along with some key attributes to enlarge the text information; 2) apply topic modeling on the description and bullet points to get key topics and keywords, and compare the performance between Word2Vec model and pre-trained fine-tuned Bidirectional Encoder Representations from Transformers (BERT) model on the text attributes. Moreover, we have tested product image embeddings with different settings and compare the performance with two settings: 1) apply max-pooling on a ResNet50 with triplet loss model to get 205-dimension embeddings; 2) apply PCA on the same ResNet50 model to reduce the dimension. Based on the experiment results with different text and image embeddings, we propose a better solution which outperforms the baseline result [1] with increased 20% precision on a fixed recall (0.05). The contribution of this work includes 1) the most comprehensive ASIN catalog information to the text model is used; 2) the best combination of text and image embedding is found. The result shows smaller distance in terms of k-nearest neighbors (KNN) Euclidean measurement and significant precision increased on a down-stream click and purchase task; 3) this framework is not limited to a specific use case, and can be easily adapted to different product categories and marketplaces.


使用Google Meet線上直播,以講者簡報畫面加聲音方式
演講開始前20分鐘可進入會議,請點選下列連結後按下「要求加入」即可
 

This is an image

This is an image