Suggest edit — 17-08-07 Deep Problems with Machine Learning

Title

Name

Note

---
title: "17-08-07 Deep Problems with Machine Learning"
visibility: public
---

# 17-08-07 Deep Problems with Machine Learning

Category: [[idea-lists-upon-request|Idea Lists (Upon Request)]]

[Read the original document](https://docs.google.com/document/d/1bI-fCV2uoxzfyyVo0cOaSm_OPZ1x9QPrfA6lG-Pgf58/edit?usp=drivesdk&sa=D&ust=1596495076912000&usg=AOvVaw1w5rLociKOxVln3dQy0__1)

---

1. Overemphasis on unsupervised learning due to conflation between humans giving labels to data manually and the math of supervised learning
   1. Prediction of future input states hierarchically across time allow for model based, common-sense aware counterfactual reasoning for planning and causal learning
2.  The fact that gradient boosting based on decision trees, which merely capture discontinuities without capturing continuous structure or being able to generalize non-locally, are state of the art on ~all datasets that don’t have compositional structure is a strong indictment of the state of machine learning.
3. Representations in NLP are the heuristic result of thinking about context / co-occurrence
4. Transfer learning is stuck on homogeneous data (pixels, frequencies, words)
   1. Absence of metadata-informed transfer learning over heterogeneous data
5. Representations are undissected, and so transfer is crude
6. Lack of emphasis on causal reasoning - Anticausal problems everywhere
7. RL doesn’t model the world (model free)
8. Model Free Natural Language Processing - doesn’t model the underlying cause of language, just the language itself
9. Data is framed in 2D, vector = datapoint style
   1. Inability to do hierarchical learning
10. Algorithms input paradigm is fixed
11. The data we collect is a function of the data we anticipate we can use, which depends on our tools
12. Lack of quality benchmarks outside of Vision (say, NLP and Babi tasks)
13. Time series data representation is pigeonholed into 2D data representation because of the way we structured our algorithms
14. The fact that evolutionary methods are close in performance to reinforcement learning is a strong indictment of our ability to learn in those environments.

---

*Source: [Original Google Doc](https://docs.google.com/document/d/1bI-fCV2uoxzfyyVo0cOaSm_OPZ1x9QPrfA6lG-Pgf58/edit?usp=drivesdk&sa=D&ust=1596495076912000&usg=AOvVaw1w5rLociKOxVln3dQy0__1)*