PDA

View Full Version : Oracle Data Mining


szalas
06-05-2004, 11:00 AM
Hi,
I have question about Oracle implementation of data mining algorithm -
O-Cluster.
In the paper "O-Cluster: Scalable Clustering of Large High Dimensional Data
Sets" is said that algorithm chooses the best cutting plan in the histogram
using chi-square statistical test :
2*(observed - expected)^2/expected > 3.843
,where
observed - histogram count of the valley
expected - average of the histogram counts of the valley and the lower peak
I have clustered example set of data.
I found out using Data Mining Browser where cutting planes go through and
used above-mentioned equation to calculate value of chi-square
and I've never got value above 3.843
Data Mining Browser shows that histogram counts are in the range <0,1> so
how can it be possible to achieve value 3.843 using above-mentioned
equation.
I would be grateful if someone explains me what is going on
Thanks in advance
szalas


MyLounge.com Site Map
Forum: Cars, Cell Phone, Database, Games, Home Improvement, IT, Music, School, Sports, Web Design, Web Server, Weight Loss

The MyLounge.com forum is intended for informational use only and should not be relied upon and is not a substitute for any advice. The information contained on MyLounge.com are opinions and suggestions of members and is not a representation of the opinions of MyLounge.com. MyLounge.com does not warrant or vouch for the accuracy, completeness or usefulness of any postings or the qualifications of any person responding. Please consult a expert or seek the services of an attorney in your area for more accuracy on your specific situation. Please note that our forums also serve as mirrors to Usenet newsgroups. Many posts you see on our forums are made by newsgroup users who may not be members of MyLounge.com Term of Service