Questions tagged [definitions]

a discussion (meta) tag used when there exists disagreement or confusion about the everyday meaning of a term or phrase.

Use the tag when:

You think we should create an official definition.

An existing Tag Wiki needs a more precise definition to avoid confusion and we need to create consensus before an edit.

(rough draft - needs filling out)

40 questions
26
votes
4 answers

Is Data Science the Same as Data Mining?

I am sure data science as will be discussed in this forum has several synonyms or at least related fields where large data is analyzed. My particular question is in regards to Data Mining. I took a graduate class in Data Mining a few years back. …
demongolem
  • 413
  • 5
  • 10
24
votes
3 answers

Starting my career as Data Scientist, is Software Engineering experience required?

I am an MSc student at the University of Edinburgh, specialized in machine learning and natural language processing. I had some practical courses focused on data mining, and others dealing with machine learning, bayesian statistics and graphical…
cpumar
  • 815
  • 1
  • 10
  • 14
16
votes
3 answers

Parallel and distributed computing

What is(are) the difference(s) between parallel and distributed computing? When it comes to scalability and efficiency, it is very common to see solutions dealing with computations in clusters of machines, and sometimes it is referred to as a…
Rubens
  • 4,117
  • 5
  • 25
  • 42
13
votes
4 answers

What is the difference between outlier detection and anomaly detection?

I would like to know the difference in terms of applications (e.g. which one is credit card fraud detection?) and in terms of used techniques. Example papers which define the task would be welcome.
Martin Thoma
  • 19,540
  • 36
  • 98
  • 170
11
votes
3 answers

Data Science oriented dataset/research question for Statistics MSc thesis

I'd like to explore 'data science'. The term seems a little vague to me, but I expect it to require: machine learning (rather than traditional statistics); a large enough dataset that you have to run analyses on clusters. What are some good…
9
votes
1 answer

Original Meaning of "Intelligence" in "Business Intelligence"

What does the term "Intelligence" originally stand for in "Business Intelligence" ? Does it mean as used in "Artificial Intelligence" or as used in "Intelligence Agency" ? In other words, does "Business Intelligence" mean: "Acting smart &…
8
votes
2 answers

Why is finite precision a problem in machine learning?

Can you explain what is finite precision? Why is finite precision a problem in machine learning?
7
votes
5 answers

How is Data Science related to Machine learning?

I went through this comparison of analytic disciplines and this perspective of machine learning, but I am not finding any answers on the following: How is Data Science related to Machine learning? How is it not related to Machine Learning?
Subham Tripathi
  • 189
  • 1
  • 5
6
votes
1 answer

How to define a custom resampling methodology

I'm using an experimental design to test the robustness of different classification methods, and now I'm searching for the correct definition of such design. I'm creating different subsets of the full dataset by cutting away some samples. Each…
gc5
  • 879
  • 2
  • 9
  • 17
6
votes
3 answers

What is an alternative name for "Unstructured Data"?

I'm writing my thesis at the moment, and for some time - due to a lack of a proper alternative - I've stuck with "unstructured data" for referring to natural, free flowing text, e.g. Wikipedia articles. This nomenclature has bothered me from the…
Benjamin B.
  • 245
  • 1
  • 6
6
votes
3 answers

What is the meaning of the term "pipeline" within data science?

People often refer to pipelines when talking about models, data and even layers in a neural network. What can be meant by a pipeline?
n1k31t4
  • 15,468
  • 2
  • 33
  • 52
5
votes
2 answers

What does it mean when we say an algorithm/metric is agnostic

Problem I have all kinds of machine learning terms that co-occur with the word "agnostic", including model-agnostic learning, model-agnostic metric. From the dictionary, it explains the word "agnostic" in the following way a person who holds the…
Mr.Robot
  • 93
  • 1
  • 1
  • 6
5
votes
2 answers

Difference between Data Engineer and Data Scientist

I'm very confuse with the term Data Engineer and Data Scientist. There are lot of jobs available for both roles in current market with almost same technical skills requirement. Are they same or different ? Which role is recommended for someone from…
Ravi
  • 151
  • 3
4
votes
1 answer

What's the difference between data products and intelligent systems?

Basically, both are software systems that are based on data and algorithms.
user3643160
  • 163
  • 6
4
votes
1 answer

Network structure: k-cliques vs. p-cliques

In network structure, what is the difference between k-cliques and p-cliques, can anyone give a brief explaination with examples? Thanks in advanced! ============================ EDIT: I found an online ppt while I am googling, please take a look on…
user3663635
  • 151
  • 3
1
2 3