|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectopendap.crawler.Equivalence
public class Equivalence
Store information about a particular 'equivalence class'. Each URLGroup will likely have several of these. Once all the URLs are processed, we will want to look at how individual patterns are distributed to see if the equivalence class should be split or not. The likely criteria for the splitting a class is that all of the instances fall into a very small number of distinct values with zero outliers. At least that's the idea.
URLProcessedComponents
,
Serialized FormNested Class Summary | |
---|---|
class |
Equivalence.DateClassifications
|
class |
Equivalence.SortedValues
|
class |
Equivalence.Values
|
Constructor Summary | |
---|---|
Equivalence(int n,
URLProcessedComponents.Lexeme l)
Make a new Equivalence. |
Method Summary | |
---|---|
void |
add(ParsedURL u)
Add a new component literal and its source URL to this equivalence class. |
void |
addDateClassification(DateClassification.DatePart dp)
|
Equivalence.DateClassifications |
getDateClassifications()
|
int |
getNumberDateClassifications()
|
int |
getNumberOfValues()
How many discreet values exist for this equivalence class? |
int |
getOccurrences(java.lang.String comp)
How many times does the value 'comp' show up? For a pattern, this can be any number between 1 and getTotalMembers(). |
ParsedURL |
getParsedURL(java.lang.String comp)
|
java.lang.String |
getPattern()
|
int |
getPatternPosition()
|
Equivalence.SortedValues |
getSortedValues()
|
int |
getTotalMembers()
How many URLs contributed to this class? Should be the same as URLGroup's totalMembers. |
Equivalence.Values |
getValues()
|
boolean |
hasDateClassification(DateClassification.DatePart dp)
|
boolean |
isLitteral()
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public Equivalence(int n, URLProcessedComponents.Lexeme l)
n
- Which parsed component of the URL does this Equivalence correspond tol
- What is the component value - and is it a pattern?Method Detail |
---|
public void add(ParsedURL u)
u
- The Parsed URLpublic int getPatternPosition()
public java.lang.String getPattern()
public boolean isLitteral()
public int getTotalMembers()
public int getOccurrences(java.lang.String comp)
comp
-
public int getNumberOfValues()
public Equivalence.Values getValues()
public Equivalence.SortedValues getSortedValues()
public ParsedURL getParsedURL(java.lang.String comp)
public void addDateClassification(DateClassification.DatePart dp)
public boolean hasDateClassification(DateClassification.DatePart dp)
public int getNumberDateClassifications()
public Equivalence.DateClassifications getDateClassifications()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |