Questions tagged [feature-reduction]

9 questions
2
votes
0 answers

Reduce number of vectors in dataset to achieve the "same average dimensions result"?

Edit for re-opening the question, I'll try to answer questions made by @user2974951: I have a large user preference statistics for trichotomic data sets. You can visualize each data trio as a 3D vector with X, Y and Z values. All vectors complies to…
1
vote
0 answers

Information compression for variable input size

Is there a way to compress information of a variable input size? Autoencoder requires standardized input sizes. Although I can add masks on the cost function and add dummy features to standardize input/output size, I am hesitant with the potential…
Simon
  • 111
  • 1
1
vote
3 answers

Correlation Matrix for non-numeric features

Currently, I have dataset with numeric as well non-numeric attributes. I am trying to remove the redundant features in the dataset using R Programming Languages. Note: Non-numeric attributes cannot be turned into binary. The Caret R package…
ray mai
  • 11
  • 1
  • 2
1
vote
0 answers

Does it exist feature selection/reduction techniques in $O(n \cdot d)$?

I'm curious to know if feature selection and/or feature reduction techniques exist, which are linear on number of data $n$ and on number of dimensions $d$. References and source code are very welcome.
KyBe
  • 410
  • 3
  • 13
0
votes
2 answers

Using PCA as features for production

I struggle with figuring out how to proceed with taking PCA into production in order to test my Models with unknown samples. I'm using both an One-Hot-Encoding an an TF-IDF in order to classify my elements with various models, mainly KNN. I know i…
0
votes
2 answers

Feature reduction by removing certain columns in dataframe

I am working with the Emotion recognition model with the IEMOCAP dataset. For the feature extraction, I am taking mel-spectrogram and then convert it into a NumPy array and converting the array into a data frame of spectrogram features. The…
adikh
  • 101
  • 1
0
votes
0 answers

How should I encode 'dynamic' features (with multiple instances) along with 'static' features (single instances)?

Suppose I have to predict if a certain product from an assembly line in a factory will be a scrap. This product has let's say 'static' data like a certain shape. A certain vendor, etc. And, it can have 'dynamic' data this meaning it can have for…
0
votes
1 answer

Heuristics, methods to speed up searches over subsets of big set (combinatorially NP hard probably)

I have a reasonable-sized set of size N (say 10 000 objects) in which I am searching for groups of compatible elements. Meaning that I have a function y = f(x_1, x_2, x_3, ..., x_n) returning bool 0/1 answer whether n elements are compatible. We are…
0
votes
2 answers

Does it make sense to randomly select features as a baseline?

In my paper, I am saying that the accuracy of classification is $x\%$ when using the top N features. My supervisor thinks that we should capture the classification accuracy when using N randomly selected features to show that the initial feature…
Sterls
  • 160
  • 1
  • 7