Questions tagged [data-indexing-techniques]

In Computer science and Technologies The Data is the most important part. Since to work with data in an efficient manner to store the data in mediums and reuse, we use some techniques which named in general indexing. This tag interests with these techniques, efficiency, and anything about using stored data with indexing

In Computer science and Technologies The Data is the most important part. Since to work with data in an efficient manner to store the data in mediums and reuse, we use some techniques which named in general indexing.

This tag interests with these techniques, efficiency, and anything about using stored data with indexing

9 questions
10
votes
1 answer

What is the most efficient data indexing technique

As we all know, there are some data indexing techniques, using by well-known indexing apps, like Lucene (for java) or Lucene.NET (for .NET), MurMurHash, B+Tree etc. For a No-Sql / Object Oriented Database (which I try to write/play a little around…
3
votes
1 answer

While using reindex method on any dataframe why do original values go missing?

This is the original Dataframe: What I wanted : I wanted to convert this above data-frame into this multi-indexed column data-frame : I managed to do it by this piece of code : # tols : original dataframe cols =…
Arnav Das
  • 132
  • 1
  • 8
2
votes
0 answers

Does statsmodels fully support MultiIndex?

The below code snippet shows how statsmodels seems to flatten MultiIndex tuples by joining them with an underscore "_". import numpy as np import pandas as pd from statsmodels.regression.linear_model import OLS K = 2 N = 10 ERROR_VOL =…
OldSchool
  • 251
  • 1
  • 2
  • 8
2
votes
0 answers

Spatial index for variable kernel nonparametric density

I'm trying to build a nonparametric density function for a fairly large dataset that can be evaluated efficently, and can be updated efficiently when new points are added. There will only ever be a maximum of 4 independent variables, but we can…
user1158559
  • 151
  • 3
1
vote
0 answers

Book indexing data science project

Is it possible to perform Book index searching using Machine learning algorithms? Inputs : 1 Book pages with page numbers as images. 2 Index words in the book. Output: Tracing the page number/s with the indexes provided.
1
vote
1 answer

Index for efficient argmax(w.x) query ~ 20d

I'm looking for a spatial index that can efficiently find the most extreme n points in a certain direction, i.e. for a given w, find x[0:n] in the dataset where x0 gives the largest value of w.x and x1 the second largest value of w.x, etc... . Is…
user1158559
  • 151
  • 3
1
vote
1 answer

The difference between Faiss Index and a Database Index

An index points to data in a table. In a database, indexes are similar to those in books. I am a little bit confused about the meaning of index in Faiss library and how it's different from the one in the database please if possible?
Avv
  • 231
  • 2
  • 9
1
vote
0 answers

How to solve this IndexError?

I have created a training dataframe Traindata as following: dataFile='/content/drive/Colab Notebooks/.../Normal_Anomalous_8Digits.csv' data8=pd.read_csv(dataFile) And Traindata looks like the following: Here Output is predicted variable which is…
0
votes
1 answer

Looking for a 'CITY, STATE' within a body of text (from a CITY-STATE database)

I'm looking for an optimal way to search a large body of text for a combination of words that resemble any CITY, STATE combination I have in a separate CITY-STATE database. My only idea would be to do a separate search against the body of text for…