Charles H. Martin, PhD’s Post

View profile for Charles H. Martin, PhD, graphic

AI Specialist and Distinguished Engineer (NLP & Search). Inventor of weightwatcher.ai . TEDx Speaker. Need help with AI ? #talkToChuck

Why alpha=2 is the ideal state of a NN layer ? In our upcoming monograph, A SemiEmpirical Theory of (Deep) Learning, we show that the weightwatcher HTSR metrics can be derived as a phenomenological Effective Hamiltonian, but one that is governed by a scale-invariant partition function, just like the scale invariance in the Wilson Renormalization Group (RG). And, of course, the RG equations apply near or at the critical point or phase boundary, and are characterized by a critical, universal power-law exponent. And this is exactly what we observe empirically. Both the universal exponent, alpha=2, *and* the signature of scale invariance (the detX condition). And you can observe it too! pip install weightwatcher import weightwatcher as ww watcher = ww.WeightWatcher(model='your model or model folder') details = watcher.analyze(plot=True, detX=True) Check out the blog for more details on how you yourself can check for scale-invariant behavior in your own NN layers: https://lnkd.in/gRnDJzg3 Want to learn more ? Check out: https://weightwatcher.ai and if you want to see the 'proof' or just learn more about weightwatcher, join our Community Discord server. #talkToChuck #theAIguy

  • graphical user interface, text, application, email
David Pierce

AI/ML SME & Technologist w/ 15+ yrs of Expertise in Automation, Data Utilization & Risk Mitigation @ Fortune-X Companies

3w

Love that your work also predicted the early stopping necessary for highly compressed models given that double descent curve and less ‘wiggle room’ for going from grok to catastrophic forgetting. It was a pleasure getting to read the latest pre-print! Wishing you well Charles! ✅🙏📊

To view or add a comment, sign in

Explore topics