Resolve "Uniform ngrams creation"
Related to #224
I am trying to understand better how the ngrams algorithms work.
- In analyzing
G.D.A.Flow
->reIndex
, I simplified things:- We don't need to select whole contexts with hyperdata, it's enough just to select context id with hyperdata
-
ngramsByDoc
was a list, generated from a list of docs. I split that into a pure function being mapped overdocs
. - I added unit tests in
T.Ngrams.Count
- I added
TermsWeight
, thisnewtype
is clearer thanInt
that we had before
- I refactored
WithQuery
to have only datafield. This is described in #441