Questions tagged [h2o]

H2O is an in-memory platform for distributed, scalable machine learning. H2O uses familiar interfaces like R, Python, Scala, Java, JSON and the Flow notebook/web interface, and works seamlessly with big data technologies like Hadoop and Spark. H2O provides implementations of many popular algorithms such as GBM, Random Forest, Deep Neural Networks, Word2Vec and Stacked Ensembles. It is an open source project maintained by H2O.ai (formerly known as 0xdata).

12 questions
2
votes
0 answers

h2o much faster than neuralnet (in R)

I’m a novice to machine learning. I've been trying out different neural network implementations in R, including the neuralnet package and the deeplearning function of the h2o package. For neuralnet, the default setting is one hidden layer with one…
user110645
  • 31
  • 1
2
votes
1 answer

How to extract the sample split (values) of decision tree leaves ( terminal nodes) applying h2o library

Sorry for a long story, but it is a long story. :) I am using the h2o library for Python to build a decision tree and to extract the decision rules out of it. I am using some data for training where labels get TRUE and FALSE values. My final goal is…
Sapiens
  • 21
  • 3
2
votes
0 answers

multi class classification : unbalanced data - good testing results poor prediction results

I have unbalanced dataset with 11 classes where 1 one class is 30% and rest are between 5-12%. I am not a hardcore programmer so I am using the product from https://www.h2o.ai/. I used GBM and DRF and used the option to balance the classes and the…
1
vote
0 answers

Which loss functions does h2o.gbm use by default?

the GBM implementation of the h2o package only allows the user to specify a loss function via the distribution argument, which defaults to multinomial for categorical response variables and gaussian for numerical response variables. According to…
user111690
  • 11
  • 1
1
vote
1 answer

Running H2O in databricks

I am trying to run H2O in databricks. However, when I do the following: hc = pysparkling.H2OContext.getOrCreate(spark) I get the following error: java.lang.AbstractMethodError Does anyone know what the problem could be?
physics_2015
  • 129
  • 2
  • 8
0
votes
1 answer

Which decision tree algorithm does H2O use?

Does H2O's plain random forest use CART, C4.5, 5.0, or something else? I cannot find this information. sklearn's docs say they use a modified version of CART, and I assume H2O also uses something like CART.
0
votes
0 answers

How can I prevent overfitting?

hope to find you well ! I am trying to build a model to classiffy customers with propensity to buy, but i cannot get rid of overfitting! My approach is the following: I have created the train dataset with unbalanced approach and have now a target 1…
0
votes
1 answer

H2O deep learning model performance

I am discovering H2O deeplearning and I would like to have your point of view about the performance that's performed my model on classification problem. Do you think my model is overfitting? dl_fit2 <- h2o.deeplearning(x = predictors, y = response, …
user979974
  • 137
  • 5
0
votes
1 answer

Modelling in python and scoring in MATLAB?

I have model objects either pickled object or H2O POJO. Is it possible to call those objects and do the scoring in MATLAB?
mlee_jordan
  • 153
  • 1
  • 8
0
votes
0 answers

H20 AutoML Parallelism

Does H2O AutoML parallelize different models when launched via train ? (or you should specify it somehow?) If so, can you show an example ?
0
votes
0 answers

why is H2O using only a part of the data?

I have this dataframe: > head(df_clas_sn) country serial_no_of_generator_1 serial_no_of_generator_2 serial_no_of_generator_3 unit_type 11 Germany XY 01 0620 ORiP 12 India …
Ben
  • 510
  • 5
  • 13
0
votes
1 answer

AutoML for categorical feature encoding

I have an input dataset with more than 100 variables where around 80% of the variables are categorical in nature. While some variables like gender, country etc can be one-hot encoded but I also have few variables which have an inherent order in…
The Great
  • 2,525
  • 16
  • 40