Meeting with Sandra 1-28-2020

Sandra wants to work on getting baselines running for all the languages we're examining

  • Still running into difficulty getting Petya access to the system since Brandi left

She thinks that using YASS for Arabic may be useful

  • It may not be the most linguistically sound way to do things but it will be consistent and not be too agressive as a root identification system would (if we reduce the words to only their roots, maybe we loose too much information)
  • Maybe for arabic and other languages it makes sense to use YASS to do splitting rather than stemming (e.g. keep the suffix that gets stripped)