Yoshua Bengio - The Consciousness Prior (2017)
History /
Edit /
PDF /
EPUB /
BIB /
Created: September 28, 2017 / Updated: November 2, 2024 / Status: finished / 3 min read (~423 words)
Created: September 28, 2017 / Updated: November 2, 2024 / Status: finished / 3 min read (~423 words)
- Represent conscious states as low-dimensional vectors
- Conscious states are taken from high-dimensional unconscious states
- It is likely that some form of RNN will be used to learn how to extract the conscious states from the unconscious state
- Consciousness as defined by Locke: consciousness is "the perception of what passes in a man's own mind", or awareness of an external object or something within oneself
- Let st be the observed state at time t and let ht be the high-level representation derived from st (and from past observed values st−k in the partially observable case)
ht=F(st,ht−1)
- The conscious state ct is defined as a very low-dimensional vector which is derived from ht by a form of attention mechanism applied on ht, taking into account the previous conscious state as context
ct=C(ht,ct−1,zt)
where zt is a random noise source
- The cognitive interpretation is that the value of ct corresponds to the content of a thought, a very small subset of all the information available to us unconsciously, but which has been brought to our awreness by a particular form of attention which picks several elements or projections from ht
- The function C is the consciousness RNN and because of its random noise inputs, produces a random choice of the elements on which the attention gets focused
- To capture the assumption that a conscious thought can encapsulate a statement about the future, we introduce a verifier network which can match a current representation state ht with a past conscious state ct−k:
V(ht,ct−k)∈R
- V(ht,ct−k) indicates the consistency of ct−k with ht, e.g., estimating the probability of the corresponding statement being true, given ht
- We would like to define an objective (or reward) function which embodies the idea that the attended (conscious) elements are used in some way whose value can be quantified and optimized
- There are two distinct mechanisms at play which contribute to map the high-level state representation to the objective function
- the attention mechanism which selects and combines a few elements from the high-level state representation into a low-dimensional "conscious sub-state" object
- the predictions or actions which are derived from the sequence of these conscious sub-states
- Bengio, Yoshua. "The Consciousness Prior." arXiv preprint arXiv:1709.08568 (2017).