PDA

View Full Version : Oracle data mining - O-cluster


szalas
01-12-2004, 10:21 AM
Hi,
I've implemented O-cluster algorithm and during doing some tests I've
noticed strange results. I hope that someone had something to do with this
alghoritm and is able to answer for my questions :
1. O-cluster uses data histograms. Are the bins' values computed in usual
way for histograms :
number of points in bin range/(number of all points*bin width)
2. Is the equation for chi-square test correct ??
2*((observed-expected)^2)/expected, where
observed is the value of histogram valley and expected is the average of
the histogram counts of the valley and the lower peak.
3. I compute bin width as 3.49*standard_deviation*number of points^(-1/3)

My problem is that I've never reached min statistical significance(3.843)
for none of my data sets.It is possible that i have wrong data (maybe too
small for o-cluster, too few dimensions) but some other clustering
alghoritms were able to obtain some clusters from my data sets.

Thank you very much for any clues, ideas and places where i can find some
additional help.
Szalas


MyLounge.com Site Map
Forum: Cars, Cell Phone, Database, Games, Home Improvement, IT, Music, School, Sports, Web Design, Web Server, Weight Loss

The MyLounge.com forum is intended for informational use only and should not be relied upon and is not a substitute for any advice. The information contained on MyLounge.com are opinions and suggestions of members and is not a representation of the opinions of MyLounge.com. MyLounge.com does not warrant or vouch for the accuracy, completeness or usefulness of any postings or the qualifications of any person responding. Please consult a expert or seek the services of an attorney in your area for more accuracy on your specific situation. Please note that our forums also serve as mirrors to Usenet newsgroups. Many posts you see on our forums are made by newsgroup users who may not be members of MyLounge.com Term of Service