|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
Object org.apache.spark.mllib.feature.IDFModel
public class IDFModel
:: Experimental :: Represents an IDF model that can transform term frequency vectors.
Method Summary | |
---|---|
Vector |
idf()
|
JavaRDD<Vector> |
transform(JavaRDD<Vector> dataset)
Transforms term frequency (TF) vectors to TF-IDF vectors (Java version). |
RDD<Vector> |
transform(RDD<Vector> dataset)
Transforms term frequency (TF) vectors to TF-IDF vectors. |
Vector |
transform(Vector v)
Transforms a term frequency (TF) vector to a TF-IDF vector |
Methods inherited from class Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Method Detail |
---|
public Vector idf()
public RDD<Vector> transform(RDD<Vector> dataset)
If minDocFreq
was set for the IDF calculation,
the terms which occur in fewer than minDocFreq
documents will have an entry of 0.
dataset
- an RDD of term frequency vectors
public Vector transform(Vector v)
v
- a term frequency vector
public JavaRDD<Vector> transform(JavaRDD<Vector> dataset)
dataset
- a JavaRDD of term frequency vectors
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |