Questions tagged [data-analysis]

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

279 questions
26
votes
7 answers

Is Python a viable language to do statistical analysis in?

I originally came from R, but Python seems to be the more common language these days. Ideally, I would do all my coding in Python as the syntax is easier and I've had more real life experience using it - and switching back and forth is a pain. Out…
confused
  • 488
  • 4
  • 10
10
votes
4 answers

Math PhD (Nonlinear Programming) switching to Data Science?

I am a math Ph.D. student who is interested in going to the industry as a Data Scientist after graduation. I will briefly give some background on my education before posing my question, so that it is better understood: Maths Coursework: This has…
8
votes
1 answer

How far can one go with excel?

in my business we handle all analytics through Excel. This includes mostly scheduling, production planning and accounting operations. We currently are looking into adding a bit of predictive modelling and Excel does suffice to a point, but doesn't…
Jcart
  • 340
  • 1
  • 6
7
votes
1 answer

Covariance as inner product

Why is covariance considered as inner product if there is no projection of one vector onto another? Right now I perceive this as just a multiplication of $x$ segment of vector($x_i - \bar{x}$) and $y$ segment($y_i - \bar{y}$) of the same vector in…
user641597
  • 133
  • 3
  • 7
6
votes
0 answers

Fitting model to differenced time series

I have a time series on daily stock price of company(2013 data points).I took a first order difference and the following acf and pacf plots of the differenced series were obtained. However, I am unable to come up with a suitable ARMA model. Can…
Jor_El
  • 231
  • 1
  • 2
  • 4
5
votes
2 answers

Plot Two Categorical Variables

How to plot two categorical variables in Python or using any library? I want to plot the Playing Role of a Cricketer (Batsman, Bowler, etc.) VS Bought_By (Franchise Names, e.g., CSK, DC, etc.). The logic here is to plot the cricket role vs…
Pradeep Singh
  • 83
  • 1
  • 1
  • 6
5
votes
1 answer

experimental design in R project

I want to know of any repositories that contain complete experimental design in R covering basic test and analyses? I want to take a top-buttom approach to learn step by step through a real project how that works. Do you know of any places to find…
user18602524
  • 171
  • 4
5
votes
3 answers

How can data science teams inside businesses measure costs and efficiency of their technical work?

How can data science teams measure and improve costs of their technical work, when they often don't know the monetary value of the datasets and insights they are producing? Are they using industry based benchmarks for technical development, and some…
Guest
  • 51
  • 1
5
votes
2 answers

ETL and Data Engineering - is it purely the knowledge of tools or is there theory behind it?

I would like to better understand what a good Data Englineer must know or what he does. Job descriptions mostly list tools that are required, such as Python. If it is possible to separate Data Engineering from Data Science, on what principles is…
MindYB
  • 51
  • 3
5
votes
5 answers

Additive vs Multiplicative model in Time Series Data

The above time series plot is a daily closing stock index of a company. I want to know which model between additive and multiplicative best suits the above data. I know what the two models are, but i haven't been able to figure out the correct…
Jor_El
  • 231
  • 1
  • 2
  • 4
5
votes
4 answers

What is the difference between data analysis and machine learning?

It seems to me that machine learning (especially deep learning) can work with thousands (even millions) of different inputs. After training an ML model on the inputs, the computer has hopefully "learned" something about the connections between these…
brad
  • 51
  • 1
  • 2
5
votes
2 answers

What advantages does Data Visualization have in EDA?

It is not clear to me what advantage the EDA data visualization provides. By advantage I mean what decision I will make according to one or the other visualization. Could someone give me an example where the data visualization makes me decide for…
4
votes
1 answer

Spatially constrained geospatial similarity

What's the current methodology for clustering geospatial data by features? Example: I have some demographic dataset. Let's say this contains average home price and population density. So, an example correlation here would be home price vs population…
4
votes
2 answers

find most dense neighborhood of points in high dimensional space

I'm working on a project where I have many high-dimensional points and I want to find the most dense neighborhood of them. Ideally, out of my ~500 points that are each a 4x300 matrix (300 ms time series of four variables) I want to find the ~30…
4
votes
1 answer

Monthly trend with fb prophet-Interpreting the graph

I have monthly data with month/year in one column and price on another. I would like to get a yearly trend with fb prophet library in python (how to use monthly data with the library is explained at the end of this page ). This is my code: import…
Munira
  • 157
  • 2
  • 9
1
2 3
18 19