Questions tagged [counts]

14 questions
15
votes
5 answers

How can I ensure anonymity with queries to small datasets?

I'm building a service that will contain personal data relating to real people. Initially the dataset will be quite small, and as such it may be possible to identify individuals if the search parameters are narrowed sufficiently. An example of a…
2
votes
1 answer

Confidence Intervals for Multi-Categorical Votes

I have an ngram-based language model that produces a long tag list for a given sentence. For example, the just-previous sentence, broken into bigrams, and run through the model might produce something like: {I have}=>C1 {have an}=>C2 {an…
1
vote
0 answers

Modeling count data with time-dependent rate

For processes of discrete events occurring in continuous time with time-independent rate, we can use count models like Poisson or Negative Binomial. For discrete events that can occur once per sample in continuous time, with a time-dependent rate,…
Bridgeburners
  • 229
  • 1
  • 7
1
vote
2 answers

R : Counting the number of observations per category

I'm currently starting out in R and wondering how to count the number of observations per day, per node, per replicate from the below dataset, and store in a different data set. The original dataset looks like this: Would like the resulting dataset…
lorena
  • 11
  • 1
  • 1
  • 2
1
vote
1 answer

How to group by one column and count frequency from other column for each item in the previous column in python?

I am trying to group my data by the 'ID' column. Then I want to count the frequency of 'sequence' for each 'ID'. Here is a sample of the data frame: ID Sequence 101 1-2 101 3-1 101 1-2 102 4-6 102 7-8 102 4-6 102 4-6 103 …
Farah
  • 21
  • 2
1
vote
0 answers

Count number of cards stacked one over the other

I have a stack of ATM cards and I want to count the number of cards available in the stack. How to proceed through it? I'm using Python 3.6.0 and opencv2. I'm attaching PNG file of the images. Kindly provide help in this direction.[1
1
vote
0 answers

Poisson Model (w/ multiple levels X)

Question Is Poisson model the best method for predicting counts among multiple levels within nominal variable? Details Imagine data of 7000 observations, where output= Obs.Count {numeric,0,1,2..8} and features=location {factor, 13 levels} . When…
1
vote
1 answer

Data normalization of count data for neural networks

I have a sparse matrix of count data that I'm using as input to a neural network. I know, usually, the input data should be normalized (e.g. via min-max scaling, $z$-score standardization, etc.). But for features that are counts, what is a good…
1
vote
0 answers

Is zero-inflated negative binomial regression appropriate for this data? Am I interpreting it correctly?

I am evaluating whether governance predictor variables are associated with the prevalence of groundwater fecal contamination in a developing country context, as measured by TTC (Thermotolerant Coliform) counts per 100mL of water. In my data TTC is…
Billy
  • 11
  • 2
1
vote
1 answer

How to merge columns, value count them and then plot the results?

How do I get from a dataframe with multiple columns that have similar values and need to be merged: df1 = pd.DataFrame({'firstcolumn':['ab', 'ca', 'da', 'ta','la'], 'secondcolumn':['ab', 'ca', 'ta', 'da', 'sa'],…
Nicola
  • 121
  • 7
1
vote
1 answer

How to better represent three sets of categorical data?

Given three set of data with categorical integer x-axis with the same range (0-10): from itertools import chain from collections import Counter, defaultdict from IPython.display import Image import pandas as pd import numpy as np import seaborn as…
alvas
  • 2,340
  • 6
  • 25
  • 38
1
vote
1 answer

What is the name of my problem - distribution of counts of elements having certain attribute

I have the following problem: There is a large set of records. Each record in the set has an attribute. For some values of the attribute, there is only one record, for other values there are many records with the value. I want to construct the…
danatel
  • 111
  • 1
0
votes
2 answers

Which algorithm can I use to estimate total number of passengers carried from time series of passenger counts

I have time series data coming at 10sec intervals from passenger counter in a bus [10,10,10,10,9,9,9,5,5,5,10,10 ...]. I need to estimate the total number of passengers carried in 1 hour. When the counts decrease, it means someone/somepeople got…
0
votes
1 answer

Learning with duplicate count as sample weights

I have a dataset D: X = D.drop(columns=['target']) y = D['target'] D is large, but contains huge number of duplicates - and I want to speedup the learning process. I can't simply drop these duplicates, because this will lead to bias in the…
Ars ML
  • 61
  • 3