Joel Gould recommends 100K or more bytes of text for the vocab bldr. Does
anyone know if one gets any benefit by running on just a few documents,
possibly dplicated many times? In other words, what should one do when just
starting to work on a new vocabulary when only a small amount of relevant
text is available?
Thanks
Steve
![]() |