Text this: Automatic document pseudoclassification and retrieval by word frequency techniques /