by Yiping Liao and Cheng Qian

  1. Description: 

Our rule-based hybrid MT engine is customized for skincare product ingredient analysis. Our goal for this engine is to translate product label lists. The most common file types it will handle are MS Word documents and Excel spreadsheets. 

  • Sources:  http://www.cosdna.com/

It is a web-based data collection of cosmetics information, mainly ingredients. Data sources include international experts, institutions, and publications. Users can submit a new item to the ingredient list, if anything is missing. Available languages include English, Simplified Chinese, Traditional Chinese, and Japanese.

  • Dictionary: https://docs.google.com/spreadsheets/d/1taY_zae1V3AdiB0EM7QRKJokHVBJlQIMCbVdpyqWI94/edit?usp=sharing
  • Time used
    1. Dictionary building: 90 min (19:45 – 21:15)
    2. Dictionary upload and training initiate: 12 min (21:43 – 21:55)
    3. Data processing: 65 min (21:55 – 23:00)
    4. Training running: 20 min (23:00 – 23:20 Training succeeded)