Survival analysis is concerned with modelling the time before subjects change state, typically time until death or failure. One key feature of such data is that they can be censored, that is, some subjects will not have changed state before the study ends.
Questions tagged [survival-analysis]
45 questions
11
votes
2 answers
Predict task duration
I'm trying to create a regression model that predicts the duration of a task. The training data I have consists of roughly 40 thousand completed tasks with these variables:
Who performed the task (~250 different people)
What part (subproject) of…
Jurgy
- 228
- 2
- 11
6
votes
3 answers
Data driven approach to define a churn user
I'm trying to define a churn prediction model for an online service (betting/gambling). A lot of papers talk about churn analysis/prediction for telco companies where defining a churn user is straightforward: a churn user is a user who cancels his…
Geims Bond
- 61
- 1
6
votes
1 answer
How to use survival analysis for predictive maintenance for time series data?
So, I have a dataset with daily operating conditions for different machines and a flag saying if it failed or not.
Here is a snapshot of the data.
How can I use survival analysis or any other algorithm to calculate when the machine is expected to…
Chandresh Gupta
- 81
- 1
- 7
4
votes
1 answer
Time dependent Classification problem
I am trying to solve a decision making problem. In it, information evolves and increases with time for each event observed, and the history of the event may be useful.
The problem is as follows: in an electric network at time $t_0$ there is signal…
ARB
- 41
- 4
4
votes
2 answers
How do I predict survival curves using xgboost?
The xgboost package enables survival modeling using parameter arguments: objective = "survival:cox" and eval_metric = "cox-nloglik".
The predict method for the resulting model only outputs risk scores (same as type = "risk" in the survival::coxph…
Iyar Lin
- 749
- 4
- 17
4
votes
1 answer
What are available methods for modeling startup survival rates?
I am interested in modeling startup companies failure and success rates to describe what is the representative startup.
I have 40 companies in a dataset. Each company is represented as a list of all the investment financing rounds it has gone…
blue-dino
- 383
- 2
- 3
- 11
3
votes
1 answer
Can I apply survival analysis to predict if a user will revisit the website?
I have one business problem in hand which is to predict if a user will revisit the website or not within 6 months. I need to majorly understand what are the factors which make the user return and also need to give business recommendations on what…
asspsss
- 75
- 4
3
votes
1 answer
Survival regression with major event that won't happen
I would like to do some survival regression about the duration before the "death" of an individual. The final purpose is to know, given an individual, how long it should take before he'll most likely "die" (probability of the survival function to be…
MBB
- 109
- 6
3
votes
1 answer
What is the outcome of a Cox regression in xgboost?
I am creating a model using xgboost. Regarding its parameters, its objective is survival:cox and its eval_metric is cox-nloglik. The output Y ranges from -800 to 800. However, the predicted values are way to large (in range from 10^3 to 10^13). Why…
Kush Patel
- 151
- 1
- 5
3
votes
1 answer
How to get the survival duration prediction for each individual in the data by using the Kaplan-Meier method?
I am trying to learn how to use the Kaplan-Meier survival estimator model in the lifelines package. The documentation says that the KaplanMeierFitter.fit function returns "a modified self, with new properties like 'survival_function_'." I checked…
Kristada673
- 308
- 3
- 9
3
votes
1 answer
How to treat missing data for survival analysis
I have a dataset consisting of questionnaires from patient survey data. There are around 10 questions which are asked during several stages of treatment like during first day of visit, after a week, after two weeks and so on till after 3 months. Now…
Rajeev Motwani
- 91
- 1
- 1
- 5
3
votes
1 answer
Survival Analysis: Pseudo Observation Vs Stratified Cox Regression. Which one is better?
I've been looking into the Cox Regression method for Survival Analysis in Churn Prediction. Cox regression will allow us to determine the probability that a subscriber will unsubscribe after a time $t$, defined by the hazard rate:
$$
h(t \lvert X_i…
Ajay H
- 222
- 1
- 3
- 9
3
votes
2 answers
Machine Learning or Survival Analysis?
I am working on building prediction model for disk failures (time taken to occur a disk failure and what parameters could strongly affect disk failures). I am bit confused on-
What data preprocessing steps should I perform. The dataset is
highly…
Rohan
- 191
- 10
2
votes
1 answer
Survival analysis to estimate kanban tasks completion times
I am working on a problem to estimate task completion time in kanban (project management tool).
While doing EDA, I looked at tasks that are either done or cancelled. In this case, I defined the completion time as the time taken from task creation to…
Sharath
- 121
- 1
2
votes
2 answers
How to compute score and predict for outcome after N days
Let's say I have a medical dataset/EHR dataset that is retrospective and longitudinal in nature. Meaning one person has multiple measurements across multiple time points (in the past). I did post here but couldn't get any response. So, posting it…
The Great
- 2,525
- 16
- 40