[SciPy-user] scipy.stats.gaussian_kde broken?
Zachary Pincus
zpincus at stanford.edu
Tue Mar 28 20:03:54 EST 2006
Never mind. This was due to a bug in numpy.dot() which is fixed in
the newest version.
Zach
On Mar 28, 2006, at 4:33 PM, Zachary Pincus wrote:
> Hi folks,
>
> I can't seem to get scipy.stats.gaussian_kde to work properly. Here
> is an example.
>
> [In:] scipy.__version__
> '0.4.9.1754'
> [In: ] numpy.__version__
> '0.9.7.2262'
> [In:] k = scipy.stats.gaussian_kde([-2, -1, -0.5, 0, 0, 0, 0, 0.5, 1,
> 2])
> [In:] k([-100, -2, 0, 2, 100])
> array([ 0.52684053, 0.58537837, 0.52762998, 0.52684053,
> 0.52684053])
>
> Clearly the above result is wrong. The 'dataset' points cluster
> around zero, and have no support anywhere near 100 or -100. Yet the
> estimated density is essentially flat across that whole range.
>
> Even more strange is the fact that when the size of the set of points
> to estimate the density at is larger than the size of the data, the
> first estimated value is different than the others.
> [In:] k([-2])
> array([ 0.58537837])
> [In:] k([-2] * 7)
> array([ 0.58537837, 0.58537837, 0.58537837, 0.58537837,
> 0.58537837,
> 0.58537837, 0.58537837])
> [In:] k([-2] * 10)
> array([ 0.08691171, 0.58537837, 0.58537837, 0.58537837,
> 0.58537837,
> 0.58537837, 0.58537837, 0.58537837, 0.58537837,
> 0.58537837])
>
> In fact, I suspect that this first estimated value is the correct
> value, and the rest are garbage.
>
> Any thoughts?
>
> Zach Pincus
> Program in Biomedical Informatics and Department of Biochemistry
> Stanford University School of Medicine
>
> _______________________________________________
> SciPy-user mailing list
> SciPy-user at scipy.net
> http://www.scipy.net/mailman/listinfo/scipy-user
More information about the SciPy-User
mailing list