Questions tagged [non-parametric]

16 questions
4
votes
2 answers

How do you use KS-test in a data science report?

I'm writing a data science report, I want to find an exist distribution to fit the sample. I got a good looking result , but when I use KS-test to test the model, I got a low p-value,1.2e-4, definitely I should reject the model. I mean, whatever…
2
votes
1 answer

what are the main differences between parametric and non-parametric machine learning algorithms?

I am interested in parametric and non-parametric machine learning algorithms, their advantages and disadvantages and also their main differences regarding computational complexities. In particular I am interested in the parametric Gaussian Mixture…
2
votes
1 answer

Good introductory reference for Bayesian Non-parametric (Dirichlet Process / Chinese Restaurant Process)

I am looking for a recommendation for basic introductory material on Bayesian Non-parametric methods, specifically Dirichlet Process / Chinese Restaurant Process. I am looking for material which covers the modeling part as well as the inference part…
2
votes
2 answers

Whether Chi-square statistic Test helps us to assess a non-linear correlation between two categorical variables?

I have two categorical variables: sports level (1, 2, 3 and 4) and Use of supplements (Yes and No). I analyzed whether they are independent by the X² test, and their association was significant. I would like to know whether chi-squared statistic in…
1
vote
0 answers

How to automatically segment multidimensional data?

How to partition the time-series multidimensional data in the figure below into segments using an unsupervised algorithm, so that the information within the same segment remains consistent, while the information in adjacent segments differs? Note…
Cai
  • 11
  • 2
1
vote
0 answers

Multiple regression with non-normal data in minitab - help

I am aiming to assess the effect of BMI (continuous) on certain biomarkers (also continuous) whilst adjusting for several relevant variables (mixed categorical and continuous) using multiple regression. My data is non-normal which I believe violates…
shar6580
  • 11
  • 1
1
vote
1 answer

Do non-parametric models always overfit without regularization?

Let's scope this to just classification. It's clear that if you fully grow out a decision tree with no regularization (e.g. max depth, pruning), it will overfit the training data and get full accuracy down to Bayes error*. Is this universally true…
1
vote
2 answers

Linear vs Non linear regression (Basic Beginner)

So my doubt is basically in Linear regression, We try to fit a straight line or a curve for a given training set. Now, I believe whenever the features (independent variable) increases, parameters also increases. Hence computing these parameters is…
1
vote
1 answer

About confidence/prediction intervals: parametric methods VS non-parametric (via bootstrap) methods

About the methodology to find confidence and/or prediction intervals in, let's say, a regression problem, I know 2 main options: Checking normality in the estimates/predictions distribution, and applying well known Gaussian alike methods to find…
German C M
  • 2,674
  • 4
  • 18
1
vote
0 answers

Non-parametric regression on set of time series: One model for each or one for all series?

Let's say I have a set of 1D time series which values have been samples in equip-distant time steps with timestamps $1,2,3,...$, they have all the same lengths and are somewhat similar in shape. I want to apply non-parametric regression (e.g. with…
Make42
  • 752
  • 2
  • 7
  • 18
1
vote
2 answers

Two computers, different outputs for same code in R studio

So a classmate and I are working with the same data and code for a problem in a data analysis class. My code for this section looks like this: aml <- c(1.56, 1.28, 1.94, 1.15, 1.31, 1.92, 0.91, 0.96) natl <- c(1.17,0.76, 1.24, 1.01, 1.26, 1.18,…
1
vote
0 answers

Nonparametric Outlier Detection

Which Nonparametric outlier detection do you suggest to detect outliers (red points) in these plots? I have tested std, IQR, etc., but no good result. It is just one vector including normal and outliers. Thanks for your help.
0
votes
2 answers

Books about statistical inference

I'm currently taking a course "Introduction to Machine Learning" which covers the following topics: linear regression, overfitting, classification problems, parametric & non-parametric models, Bayesian & non Bayesian models, generative…
0
votes
1 answer

Logic behind the Statement on Non-Parametric models

I am currently reading 'Mastering Machine Learning with scikit-learn', 2E, by Packt. In Lazy Learning and Non-Parametric models topic in Chapter 3- Classification and Regression with k-Nearest Neighbors, there is a paragraph stating- Non-parametric…
0
votes
1 answer

pass variable length argument to mstats.kruskalwallis

I am trying to run kruskawallis test on multiple columns of my data for that i wrote an function var=['a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'] def kruskawallis_test(column): …
Ayush Ranjan
  • 401
  • 1
  • 4
  • 13
1
2