Class ExtensibleDynamicEvaluationStatistics
- All Implemented Interfaces:
- DynamicStatistics
ExtensibleDynamicEvaluationStatistics aims to keep an internal estimate of the cardinality of various statement patterns.
It support getting the overall size, any single dimension cardinality (eg. ?a rdf:type ?b) and also two multidimensional patterns (:Peter rdf:type ?b; and ?a rdf:type foaf:Person).
Since evaluation statistics are best-effort, we use HLL as sets to keep the number of statements for each pattern we support. HLL is a very memory efficient set implementation. Furthermore we hash each pattern into a fixed bucket size, 1024 for single dimension and 64 per dimension for multidimensional patterns.
This means that adding ':peter rdf:type foaf:Person' and ':lisa rdf:type foaf:Person' could potentially return getCardinality(:peter, ?b, ?c) = 2 if both :peter and :lisa hash to the same of the 1024 buckets in subjectIndex.
HLL does not support "remove" operations, so there are two sets of every index. One for all added statements and one for all removed statements. If the user adds, removes and re-adds the same statement then the cardinality for that statement will be incorrect. We call this effect "staleness". To prevent staleness from affecting the returned cardinalities this class needs to be monitored by calling the staleness(...) method. This will automatically be done every 60 seconds by the ExtensibleSailStore.
- 
Nested Class SummaryNested classes/interfaces inherited from class org.eclipse.rdf4j.query.algebra.evaluation.impl.EvaluationStatisticsEvaluationStatistics.CardinalityCalculator
- 
Constructor SummaryConstructorsConstructorDescriptionExtensibleDynamicEvaluationStatistics(ExtensibleSailStore extensibleSailStore) 
- 
Method SummaryModifier and TypeMethodDescriptionvoidadd(ExtensibleStatement statement) voidremove(ExtensibleStatement statement) voidremoveByQuery(Resource subj, IRI pred, Value obj, boolean inferred, Resource... contexts) doublestaleness(long expectedSize) voidMethods inherited from class org.eclipse.rdf4j.query.algebra.evaluation.impl.EvaluationStatisticsgetCardinality
- 
Constructor Details- 
ExtensibleDynamicEvaluationStatistics
 
- 
- 
Method Details- 
createCardinalityCalculator- Specified by:
- createCardinalityCalculatorin class- ExtensibleEvaluationStatistics
 
- 
stalenesspublic double staleness(long expectedSize) - Specified by:
- stalenessin interface- DynamicStatistics
- Returns:
- 1 if stale, 0 if not stale, 0.5 if 50% stale. Seen as, given a random statement (that has either been added, or removed), what is the probability that the statistics will return an incorrect result?
 
- 
add- Specified by:
- addin interface- DynamicStatistics
 
- 
remove- Specified by:
- removein interface- DynamicStatistics
 
- 
removeByQuerypublic void removeByQuery(Resource subj, IRI pred, Value obj, boolean inferred, Resource... contexts) - Specified by:
- removeByQueryin interface- DynamicStatistics
 
- 
waitForQueue- Throws:
- InterruptedException
 
 
-