The stats

The Hedometer data has 3656 entries The ML data has 3343 enttries.

The preprocessing done on the hedometer data makes it difficult to compare the two files. Lowercasing, all punctuation removed, spaced injeted.

Revision #1
Created Thu, Mar 19, 2020 3:28 AM by kenneth
Updated Thu, Mar 19, 2020 3:30 AM by kenneth