Questions tagged [counts]
14 questions
15
votes
5 answers
How can I ensure anonymity with queries to small datasets?
I'm building a service that will contain personal data relating to real people.
Initially the dataset will be quite small, and as such it may be possible to identify individuals if the search parameters are narrowed sufficiently.
An example of a…
mal
- 253
- 6
2
votes
1 answer
Confidence Intervals for Multi-Categorical Votes
I have an ngram-based language model that produces a long tag list for a given sentence. For example, the just-previous sentence, broken into bigrams, and run through the model might produce something like:
{I have}=>C1 {have an}=>C2 {an…
jackisquizzical
- 181
- 4
1
vote
0 answers
Modeling count data with time-dependent rate
For processes of discrete events occurring in continuous time with time-independent rate, we can use count models like Poisson or Negative Binomial. For discrete events that can occur once per sample in continuous time, with a time-dependent rate,…
Bridgeburners
- 229
- 1
- 7
1
vote
2 answers
R : Counting the number of observations per category
I'm currently starting out in R and wondering how to count the number of observations per day, per node, per replicate from the below dataset, and store in a different data set.
The original dataset looks like this:
Would like the resulting dataset…
lorena
- 11
- 1
- 1
- 2
1
vote
1 answer
How to group by one column and count frequency from other column for each item in the previous column in python?
I am trying to group my data by the 'ID' column. Then I want to count the frequency of 'sequence' for each 'ID'. Here is a sample of the data frame:
ID Sequence
101 1-2
101 3-1
101 1-2
102 4-6
102 7-8
102 4-6
102 4-6
103 …
Farah
- 21
- 2
1
vote
0 answers
Count number of cards stacked one over the other
I have a stack of ATM cards and I want to count the number of cards available in the stack. How to proceed through it? I'm using Python 3.6.0 and opencv2. I'm attaching PNG file of the images. Kindly provide help in this direction.[1
Suraj Singh
- 11
- 1
1
vote
0 answers
Poisson Model (w/ multiple levels X)
Question
Is Poisson model the best method for predicting counts among multiple levels within nominal variable?
Details
Imagine data of 7000 observations, where output= Obs.Count {numeric,0,1,2..8} and features=location {factor, 13 levels} .
When…
OctoCatKnows
- 133
- 2
1
vote
1 answer
Data normalization of count data for neural networks
I have a sparse matrix of count data that I'm using as input to a neural network.
I know, usually, the input data should be normalized (e.g. via min-max scaling, $z$-score standardization, etc.). But for features that are counts, what is a good…
Yenaled
- 11
- 1
1
vote
0 answers
Is zero-inflated negative binomial regression appropriate for this data? Am I interpreting it correctly?
I am evaluating whether governance predictor variables are associated with the prevalence of groundwater fecal contamination in a developing country context, as measured by TTC (Thermotolerant Coliform) counts per 100mL of water. In my data TTC is…
Billy
- 11
- 2
1
vote
1 answer
How to merge columns, value count them and then plot the results?
How do I get from a dataframe with multiple columns that have similar values and need to be merged:
df1 = pd.DataFrame({'firstcolumn':['ab', 'ca', 'da', 'ta','la'],
'secondcolumn':['ab', 'ca', 'ta', 'da', 'sa'],…
Nicola
- 121
- 7
1
vote
1 answer
How to better represent three sets of categorical data?
Given three set of data with categorical integer x-axis with the same range (0-10):
from itertools import chain
from collections import Counter, defaultdict
from IPython.display import Image
import pandas as pd
import numpy as np
import seaborn as…
alvas
- 2,340
- 6
- 25
- 38
1
vote
1 answer
What is the name of my problem - distribution of counts of elements having certain attribute
I have the following problem:
There is a large set of records. Each record in the set has an attribute. For some values of the attribute, there is only one record, for other values there are many records with the value. I want to construct the…
danatel
- 111
- 1
0
votes
2 answers
Which algorithm can I use to estimate total number of passengers carried from time series of passenger counts
I have time series data coming at 10sec intervals from passenger counter in a bus [10,10,10,10,9,9,9,5,5,5,10,10 ...]. I need to estimate the total number of passengers carried in 1 hour. When the counts decrease, it means someone/somepeople got…
0
votes
1 answer
Learning with duplicate count as sample weights
I have a dataset D:
X = D.drop(columns=['target'])
y = D['target']
D is large, but contains huge number of duplicates - and I want to speedup the learning process. I can't simply drop these duplicates, because this will lead to bias in the…
Ars ML
- 61
- 3