Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 274 Bytes

File metadata and controls

4 lines (3 loc) · 274 Bytes

Text-Categorization-using-NGrams-in-Apache-Spark

This is a research paper implementation based on the Big Data platform Apache Spark.

Paper Reference: Cavnar, William B., and John M. Trenkle, "N-gram-based text categorization," Ann Arbor MI 48113, no. 2 (1994): 161-175.