Questions tagged [feature-reduction]
9 questions
2
votes
0 answers
Reduce number of vectors in dataset to achieve the "same average dimensions result"?
Edit for re-opening the question, I'll try to answer questions made by @user2974951:
I have a large user preference statistics for trichotomic data sets. You can visualize each data trio as a 3D vector with X, Y and Z values. All vectors complies to…
Sanxofon
- 121
- 4
1
vote
0 answers
Information compression for variable input size
Is there a way to compress information of a variable input size?
Autoencoder requires standardized input sizes. Although I can add masks on the cost function and add dummy features to standardize input/output size, I am hesitant with the potential…
Simon
- 111
- 1
1
vote
3 answers
Correlation Matrix for non-numeric features
Currently, I have dataset with numeric as well non-numeric attributes. I am trying to remove the redundant features in the dataset using R Programming Languages. Note: Non-numeric attributes cannot be turned into binary.
The Caret R package…
ray mai
- 11
- 1
- 2
1
vote
0 answers
Does it exist feature selection/reduction techniques in $O(n \cdot d)$?
I'm curious to know if feature selection and/or feature reduction techniques exist, which are linear on number of data $n$ and on number of dimensions $d$.
References and source code are very welcome.
KyBe
- 410
- 3
- 13
0
votes
2 answers
Using PCA as features for production
I struggle with figuring out how to proceed with taking PCA into production in order to test my Models with unknown samples.
I'm using both an One-Hot-Encoding an an TF-IDF in order to classify my elements with various models, mainly KNN.
I know i…
Humpalum Druf
- 11
- 1
0
votes
2 answers
Feature reduction by removing certain columns in dataframe
I am working with the Emotion recognition model with the IEMOCAP dataset. For the feature extraction, I am taking mel-spectrogram and then convert it into a NumPy array and converting the array into a data frame of spectrogram features.
The…
adikh
- 101
- 1
0
votes
0 answers
How should I encode 'dynamic' features (with multiple instances) along with 'static' features (single instances)?
Suppose I have to predict if a certain product from an assembly line in a factory will be a scrap. This product has let's say 'static' data like a certain shape. A certain vendor, etc. And, it can have 'dynamic' data this meaning it can have for…
Toma Dragos
- 1
- 1
0
votes
1 answer
Heuristics, methods to speed up searches over subsets of big set (combinatorially NP hard probably)
I have a reasonable-sized set of size N (say 10 000 objects) in which I am searching for groups of compatible elements. Meaning that I have a function y = f(x_1, x_2, x_3, ..., x_n) returning bool 0/1 answer whether n elements are compatible. We are…
0
votes
2 answers
Does it make sense to randomly select features as a baseline?
In my paper, I am saying that the accuracy of classification is $x\%$ when using the top N features.
My supervisor thinks that we should capture the classification accuracy when using N randomly selected features to show that the initial feature…
Sterls
- 160
- 1
- 7