An algorithm’s ability to detect duplicates is the real measure of its capabilities & precision.
Subtracting the duplicate % reveals the true story and real accuracy. It’s just maths.
After deeper researches, I discovered a pervasive flaw common accross mapping algorithms & systems.
This flaw produces hidden duplicate ghosts that go undetected for long.
There are two types of duplicates: those that can be easily filtered by name or by the destination shown on the map, and those that behave like ghosts for long due to large discrepancies or outbound GPS coordinates. Duplicates can be handled by data driven systems but they have a very different meaning when used to train an agentAI.Some suppliers do not want to be exposed. Their names have been obfuscated.
Click a supplier to see its actual ID mapping accuracy
Duplicates % - ID mapping accuracy *
6%
duplicates+
triplicates+
accuracy
marriott
0%
0%
100%
hotel-matching
0.01%
0%
99.99%
yalago
0.01%
0%
99.99%
didatravel
0.01%
0%
99.99%
dotw
0.01%
0%
99.99%
sunhotels
0.04%
0.01%
99.96%
travco
0.06%
0.01%
99.95%
miki
0.11%
0.01%
99.89%
apitude
0.13%
0.06%
99.84%
tbo
0.24%
0.02%
99.76%
innstant
0.59%
0.01%
99.41%
expedia
1.04%
0.01%
98.96%
magiemix
1.68%
0.02%
98.32%
getaroom
2.11%
0.08%
97.89%
Varioteck
2.5%
0.02%
97.49%
litePanic
3.71%
0.12%
96.29%
teldar
3.84%
0.1%
96.16%
goglobal
4.07%
0.11%
95.93%
airtours
4.59%
0.13%
95.41%
gmbh
5.21%
0.15%
94.79%
Juppiter
5.49%
0.18%
94.51%
arising
5.61%
0.15%
94.39%
iol
5.89%
0.01%
94.11%
About duplicates, triplicates+ importance for an AgentAI - AI training assigns increasing relevance to them, potentially elevating irrelevant data. - In machine learning, a triplicate+ becomes significant. - Inconsistencies across duplicate hotel descriptions leads to AgentAI "hallucinations". - About half of the duplicates can be treated as ghosts. Remember that an AgentAI isn’t restricted to what the map shows. It operates beyond just the visible coordinates. - Duplicates have a different impact on agentAI, unlike they are tolerated in a standard model.