Questions tagged [programming]

38 questions
61
votes
10 answers

IDE alternatives for R programming (RStudio, IntelliJ IDEA, Eclipse, Visual Studio)

I use RStudio for R programming. I remember about solid IDE-s from other technology stacks, like Visual Studio or Eclipse. I have two questions: What other IDE-s than RStudio are used (please consider providing some brief description on them). Does…
IgorS
  • 5,444
  • 11
  • 31
  • 43
46
votes
12 answers

Data Science in C (or C++)

I'm an R language programmer. I'm also in the group of people who are considered Data Scientists but who come from academic disciplines other than CS. This works out well in my role as a Data Scientist, however, by starting my career in R and only…
Hack-R
  • 1,919
  • 1
  • 21
  • 34
11
votes
6 answers

Is Excel sufficient for data science?

I'm in the process of preparing to teach an introductory course on data science using the R programming language. My audience is undergraduate students majoring in business subjects. A typical business undergrad does not have any computer…
I Like to Code
  • 267
  • 2
  • 5
9
votes
2 answers

How prevalent is `C/C++` in machine learning development?

I am currently a data scientist mostly doing NLP, and I do most of my work inPython. Since I didn't get a CS degree in undergrad, I've been limited to very high level languages; Java, Python, and R. I somehow even took Data Structures and Algorithms…
gust
  • 237
  • 1
  • 6
4
votes
3 answers

Bidirectional Encoder Representations from Transformers in R

Can anybody suggest to me, where I can find example code for R language for BERT neural network for text mining tasks. All I can see are python examples, and I need…
Kogan
4
votes
1 answer

Practical use of oop in R

R supports a wide range of OOP designs like s3,s4,RC and others via packages,and it's a bit overwhelming to decide on which to use and a more basic question that I have is when and where do you use OOP while doing machine learning or data…
ultron
  • 143
  • 1
  • 1
  • 7
3
votes
1 answer

How to choose tools for web dashboard?

I am working in a small sales company. Started with making relatively easy, small, static reports in Excel, but then it was required to make a lot of highly-customizable and adaptable reports, that are really troublesome to do in Excel. I do a lot…
Jay Shee
  • 31
  • 1
3
votes
1 answer

How to implement a Restricted Boltzmann Machine manually?

I am learning about Restricted Boltzmann Machines and I'm so excited by the ability it gives us for unsupervised learning. The problem is that I do not know how to implement it using one of the programming languages I know without using libraries. I…
stressed out
  • 175
  • 1
  • 8
3
votes
1 answer

What can functional programming be used for in data science?

In my next academic year at university, I have the option to take a course in Advanced Functional Programming. A basic description of the course is this: "You’ll focus on a number of more advanced functional programming topics such as: programming…
Data
  • 467
  • 3
  • 11
2
votes
2 answers

Divide a column by itself with mutate_at dplyr

Hi I'd like to turn each non zero value of my selected columns to a 1 using mutate_at() BRAND MEDIA_TYPE INV1 INV2 b1 newspapers 2 27 b1 magazines 3 0 b2 …
3nomis
  • 531
  • 6
  • 17
2
votes
3 answers

Tell me some things a data scientist/statistician needs to do that can't be done in R/Stata/SAS/SPSS

Why do I need to learn to program in python or Java and learn data structures? I can't think of a single thing statisticians/data scientists need to do that isn't built in as a package in R/stata.
user64141
2
votes
2 answers

Best Programming Language for Data Science

I'm learning JS, HTML and CSS, but I doubt JS is very good at Data Analysis. So, what would you guys recommend me learning to start my "career" in Data Science? What's the best programming language for processing data? P.S. I love statistics and…
Ben
  • 123
  • 3
2
votes
3 answers

How to automate left join of multiple data frames with single data frame one by one in R

i have 206 data frames named after country(FRANCE,GERMANY,HONGKONG...) with same 37 variables in these 206 files and another data frame name pf with 9 variables. i want to do left join of these 206 data frames with PF data frame one by one…
joy
  • 61
  • 2
  • 6
2
votes
1 answer

Generating a random numpy ndarray of 0 and 1 with a specific range of 1 values

I want to generate a random numpy ndarray of 0 and 1. I want that the number of occurrences of 1 in every specific rowto range between 2 and 5. I tried:x = np.random.randint(0,2, size=(10, 10))yet i cannot control the number of ones. I tried the np.…
I Sui
  • 57
  • 3
2
votes
1 answer

cbind doesn't attribute the name of the column

I'm trying to create a new data frame that contains the row names and the first column of an existing data : i tried this #To take the rownames of my old data New_data <- as.data.frame(row.names(old_data)) #To add the first column of my old data to…
Anouar
  • 23
  • 2
1
2 3