Related to the development of software tools or platforms for data science topics. This includes APIs, open-source, visualization, modeling, mathematics, applications, etc.
Questions tagged [software-development]
17 questions
10
votes
2 answers
Software Testing for Data Science in R
I often use Nose, Tox or Unittest when testing my python code, specially when it has to be integrated with other modules or other pieces of code. However, now that I've found myself using R more than python for ML modelling and development. I…
wacax
- 3,500
- 4
- 26
- 48
6
votes
5 answers
Can Machine Learning be applied in software developement
I'm from programming background. I'm now learning Analytics. I'm learning concepts from basic statistics to model building like linear regression, logistic regression, time-series analysis, etc.,
As my previous experience is completely on…
user4290511
- 101
- 6
5
votes
1 answer
How to Deploy your trained ML model in client VM without they getting access to code
I am new to deployment and have a basic doubt about deploying my ML code on client's vm.
So I have built a python project which collects data from client site, processing, predicts and displays the result in dashboard. I have to use client VMs for…
chink
- 565
- 9
- 19
4
votes
1 answer
How to access maximum volume of tweets using Twitter Streaming API, without firehose access?
Twitter is a popular source of data for many applications, especially involving sentiment analysis and the like. I have some things I'm interested in doing with Twitter data, but here's the issue: To get all Tweets, you have to get special…
mindcrime
- 211
- 1
- 7
3
votes
2 answers
From development environment to production
I have been working on a project as part of my master degree in participation with a firm.
I developed a predictive model in the past few months that is essentially a document classification model. The biggest limitation of the research and model…
OAK
- 131
- 3
2
votes
0 answers
Is there a corpus of toy datasets specifically designed for finding bugs in data science software?
I'm looking for a corpus of toy tabular datasets that can be used to test data profiling, machine learning, data manipulation, etc. software. Some example attributes:
Strange column names (empty string, long names, duplicate names, names with…
Shoeboxam
- 21
- 2
2
votes
2 answers
Big data and data mining for CRM?
We are currently developing a customer relationship management software for SME's. What I'd like to structure for our future CRM is developing CRM with a social-based approach (Social CRM). Therefore we will provide our users (SME's) to integrate…
latefreak
- 23
- 2
2
votes
0 answers
Diagramming data science workflow?
I'm working on a consulting project for a tech client, and caught myself scratching my head about what the best way to present advanced analytics workflow is. What will be shown to the panel will focus on results, but in this particular case it is…
Alex Firsov
- 131
- 6
1
vote
1 answer
unit-testing Machine Learning models
I have been asked to unit-test my machine learning model(not the code that made the model). Since we wouldn't actually know what predictions models make, how to carry out the unit-testing to check the model's predictions against? How is this…
Naveen Reddy Marthala
- 325
- 2
- 16
1
vote
0 answers
Python, Tkinter, Application packaging and distributing with custom packages
I have built an application on Tkinter in Python 3 and I want to package that application with all the dependent packages.
I would want to build the .exe application of my python script that installs python 3, some of the packages/dependencies, and…
Raj Mehta
- 129
- 4
1
vote
1 answer
Visualizing software metrics
I have the below sets of data per application, you can call them as software metrics. These metrics vary depending on the size of an application.
Bugs
CodeSmells
Vulnerability
The size of the application is determined by LOC (Lines of code), how…
Sudheej
- 113
- 4
1
vote
0 answers
I want to learn how to construct data science packages on top of core packages. Is there a list of excellent data science packages I can learn from?
Short question
I want to learn how to construct data science packages on top of core packages. Is there a list of excellent data science packages I can learn from?
Long question
I recently came across an excellent video where Joel Grus live codes a…
myopic
- 11
- 3
1
vote
0 answers
Machine Learning Model Input and Output Flow
I am working on a backend for structuring and submitting data into a ML model. I have 3 questions regarding this process.
What is the best method to feed the model continuous data (updated every 30m interval)
What is the best method to deliver the…
Brandon
- 11
- 1
0
votes
1 answer
Surface Pro 6 vs Macbook Pro for Professional Data Science Practice
[I strongly agree this is totally very opinionated question, thus narrators feel free to vote to close it if you feel it is right, but I find endless pros and cons on the Internet, I've decided to ask the community here.]
Surface Pro 6 or Macbook…
TwinPenguins
- 4,429
- 3
- 22
- 54
0
votes
2 answers
Keep track of trainings, datasets eetc
After searching quite some time for it on Google I could not find a sufficient software/toolbox that can manage trainings of neural networks. I thought of a program that combines visualization techniques without the need to write code as well as…
Lukas
- 11
- 1