Skip to content

Latest commit

 

History

History
9 lines (6 loc) · 352 Bytes

README.md

File metadata and controls

9 lines (6 loc) · 352 Bytes

SentenceOutlier

Sentence Outlier Detection

This is an R presentation that summarizes an experiment to test a couple of methods to detect anomalus sentence in a text:

  • Stahel-Donoho Estimator Distance
  • The PCout method

Each sentence is characterized by a number of metrics such as: number of words, number of syllables, parsing tree depth, etc.