Abstract: A method and apparatus for document categorization are described. In one embodiment, the method comprises automatically selecting one or more discriminant term combinations and using the one or more discriminant term combinations for document categorization.