Questions tagged [multivariate-distribution]

41 questions
2
votes
2 answers

Can GLM( generalized linear method) handle the collinearity between the predictor variables in a regression-analysis?

I'm a beginner in Machine learning and I've studied that collinearity among the predictor variables of a model is a huge problem since it can lead to unpredictable model behaviour and a large error. But, are there some models (say GLM) that are…
2
votes
1 answer

Can the dependency between variables be deduced from data? And if so, how?

I have a data set $X$ that consists of $m$ vectors $\vec{x}$ of $n$ real-valued components. Each vector component lies within a corresponding predefined interval of valid values, which is the same for all vectors in $X$. The assumption is that there…
2
votes
2 answers

How to interpret Correlation along with Coefficients of multiple linear regression ?£

I have 10000 samples. There are 4 independent variables and 1 dependent variable. The independent variables are all centered with 0 mean. I found the correlation coefficients between each of these variables which are as below: I used linear…
2
votes
1 answer

How to resample one dataset to conform to the distribution of another dataset?

I have two datasets with 20 features, but with different feature distributions (DS_A and DS_B). How can I sample the DS_A to make its distribution similar to DS_B, with respect to multiple features?? I check the similarity/difference of two datasets…
2
votes
0 answers

How to do Multivariate Adaptive Regression Splines feature selection in python? Specifically, I need the python equivalent of the earth function in R

This is the code in R: marsModel <- earth(eval(parse(text=paste(ResponseVariable,"~."))), data = data) #build model ev <- evimp (marsModel) Response variable is self-explanatory, the data contains 178 features and 1 response variable. Each feature…
2
votes
3 answers

Confidence intervals in multivariate linear regression

I am fitting my data to a multivariate linear regression $Y = BX + \Xi$, where the response is bivariate $Y\in R^{n\times 2}$, and the predictor is uni-variate but elevated to the projective plane to account for the intercept $X\in R^{n\times…
1
vote
1 answer

Meaning of the covariance matrix?

I wonder about the excessive usage of the covariance matrix across all kinds of machine learning tools. So far, for me, the covariance is just a pre-step to get to the correlation. And as there is an obvious reason for the correlation itself, I…
Ben
  • 510
  • 5
  • 13
1
vote
0 answers

MLE for Poisson conditioned on multivariate Gaussian?

I am writing some Python code to fit 2D Gaussians to fluorescent emitters on a dark background to determine the subpixel-resolution (x, y) position of the fluorescent emitter. The crude, pixel-resolution (x, y) locations of the pixels are stored in…
1
vote
1 answer

How to build multiple variable regression having a mix of numerical & categorical features?

There is a need to estimate Annual Average Daily Traffic Volume (AADT). We have bunch of data about vehicles' speeds during several years. It is noticed that AADT depends on the average number of such samples during some time, so a regression model…
1
vote
1 answer

Linear transformation from one sample to another

Generate a Sample $\underline{Z_1}$ $\underline{Z_2}\dots \underline{Z_{5000}}$ , while $\underline{Z_i} \sim N_2[(0,0)^T,I_2]$ generate new sample with size of $ n = 5000$ by applying linear transformation on $\underline{Z_i}$ $\underline{X_1}$…
Mahajna
  • 53
  • 4
1
vote
0 answers

Getting vague results using VAR time series forecasting in python!

Firstly, I am a beginner in this field of Data Science and have tried to implement some time series models for wind speed forecasting. Also, I am aware of the fact that some regression models might give better results, but still, my aim is to crack…
1
vote
0 answers

Sequential sampling from Gaussian conditional not working

I'm trying to sequentially sample from a Gaussian Process prior. The problem is that the samples eventually converge to zero or diverge to infinity. I'm using the basic conditionals described e.g. here Note: the kernel(X,X) function returns the…
1
vote
1 answer

How to draw a sample from data set with respect to a given categorical or numerical variable based on given freely chosen distribution? (Python)

Say I have a data set for some past period. Now new data appears and for a given variable in the data and we find that the distributions have shifted (for example with "age" it would be that suddenly there are not nearly as many older people,…
1
vote
2 answers

How can I fix regression model interpretation of feature?

I'm building a regression model to predict the values of a feature $Y$ given a set of other features $X_{1}, X_{2}, X_{3}..X_{n}$. Onde of these other features, let's say $X_1$, is known to be inversely proportional to $Y$ based on domain's…
1
vote
0 answers

Hidden Markov Model with Autoregressive emission model?

So far, all standard HMM implementations I've seen assume some variation of a Gaussian Mixture (GMM) as their emission model. It can of course only have a single mixture component which reduces it to a standard multivariate normal distribution. In…
1
2 3