[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jeremynixon / Thinking / daily/2017-08-07-deep-problems-with-machine-learning.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "17-08-07 Deep Problems with Machine Learning" visibility: public --- # 17-08-07 Deep Problems with Machine Learning Category: [[idea-lists-upon-request|Idea Lists (Upon Request)]] [Read the original document](https://docs.google.com/document/d/1bI-fCV2uoxzfyyVo0cOaSm_OPZ1x9QPrfA6lG-Pgf58/edit?usp=drivesdk&sa=D&ust=1596495076912000&usg=AOvVaw1w5rLociKOxVln3dQy0__1) <!-- gdoc-inlined --> --- 1. Overemphasis on unsupervised learning due to conflation between humans giving labels to data manually and the math of supervised learning 1. Prediction of future input states hierarchically across time allow for model based, common-sense aware counterfactual reasoning for planning and causal learning 2. The fact that gradient boosting based on decision trees, which merely capture discontinuities without capturing continuous structure or being able to generalize non-locally, are state of the art on ~all datasets that don’t have compositional structure is a strong indictment of the state of machine learning. 3. Representations in NLP are the heuristic result of thinking about context / co-occurrence 4. Transfer learning is stuck on homogeneous data (pixels, frequencies, words) 1. Absence of metadata-informed transfer learning over heterogeneous data 5. Representations are undissected, and so transfer is crude 6. Lack of emphasis on causal reasoning - Anticausal problems everywhere 7. RL doesn’t model the world (model free) 8. Model Free Natural Language Processing - doesn’t model the underlying cause of language, just the language itself 9. Data is framed in 2D, vector = datapoint style 1. Inability to do hierarchical learning 10. Algorithms input paradigm is fixed 11. The data we collect is a function of the data we anticipate we can use, which depends on our tools 12. Lack of quality benchmarks outside of Vision (say, NLP and Babi tasks) 13. Time series data representation is pigeonholed into 2D data representation because of the way we structured our algorithms 14. The fact that evolutionary methods are close in performance to reinforcement learning is a strong indictment of our ability to learn in those environments. --- *Source: [Original Google Doc](https://docs.google.com/document/d/1bI-fCV2uoxzfyyVo0cOaSm_OPZ1x9QPrfA6lG-Pgf58/edit?usp=drivesdk&sa=D&ust=1596495076912000&usg=AOvVaw1w5rLociKOxVln3dQy0__1)*