Yoshua Bengio - The Consciousness Prior (2017)

History / Edit / PDF / EPUB / BIB /
Created: September 28, 2017 / Updated: November 2, 2024 / Status: finished / 3 min read (~423 words)
Machine learning

  • Represent conscious states as low-dimensional vectors
  • Conscious states are taken from high-dimensional unconscious states
  • It is likely that some form of RNN will be used to learn how to extract the conscious states from the unconscious state

  • Consciousness as defined by Locke: consciousness is "the perception of what passes in a man's own mind", or awareness of an external object or something within oneself

  • Let st be the observed state at time t and let ht be the high-level representation derived from st (and from past observed values stk in the partially observable case)

    ht=F(st,ht1)

  • The conscious state ct is defined as a very low-dimensional vector which is derived from ht by a form of attention mechanism applied on ht, taking into account the previous conscious state as context

    ct=C(ht,ct1,zt)

    where zt is a random noise source

  • The cognitive interpretation is that the value of ct corresponds to the content of a thought, a very small subset of all the information available to us unconsciously, but which has been brought to our awreness by a particular form of attention which picks several elements or projections from ht
  • The function C is the consciousness RNN and because of its random noise inputs, produces a random choice of the elements on which the attention gets focused

  • To capture the assumption that a conscious thought can encapsulate a statement about the future, we introduce a verifier network which can match a current representation state ht with a past conscious state ctk:

    V(ht,ctk)R

  • V(ht,ctk) indicates the consistency of ctk with ht, e.g., estimating the probability of the corresponding statement being true, given ht
  • We would like to define an objective (or reward) function which embodies the idea that the attended (conscious) elements are used in some way whose value can be quantified and optimized
  • There are two distinct mechanisms at play which contribute to map the high-level state representation to the objective function
    • the attention mechanism which selects and combines a few elements from the high-level state representation into a low-dimensional "conscious sub-state" object
    • the predictions or actions which are derived from the sequence of these conscious sub-states

  • Bengio, Yoshua. "The Consciousness Prior." arXiv preprint arXiv:1709.08568 (2017).