Yahoo Web Search

Search results

  1. Callum McDougall. Second Unit Director or Assistant Director. Production Manager. Additional Crew. IMDbPro Starmeter See rank. Callum McDougall is known for 1917 (2019), Mary Poppins Returns (2018) and Spectre (2015). More at IMDbPro. Contact info. Agent info.

    • Callum McDougall
  2. Callum McDougall is known as an Executive Producer, Unit Production Manager, Actor, Producer, Second Assistant Director, Co-Producer, Third Assistant Director, and Production Manager. Some of his work includes Harry Potter and the Prisoner of Azkaban, Skyfall, 1917, Spectre, Quantum of Solace, The Beach, Into the Woods, and Wrath of the Titans.

    • Overview
    • Chapter 0: Fundamentals
    • Chapter 1: Transformers & Mech Interp
    • Chapter 2: Reinforcement Learning
    • Chapter 3: Training at Scale

    This GitHub repo hosts the exercises and Streamlit pages for the ARENA 2.0 program.

    You can find a summary of each of the chapters below. For more detailed information (including the different ways you can access the exercises), click on the links in the chapter headings.

    The material on this page covers the first five days of the curriculum. It can be seen as a grounding in all the fundamentals necessary to complete the more advanced sections of this course (such as RL, transformers, mechanistic interpretability, and generative models).

    Some highlights from this chapter include:

    •Building your own 1D and 2D convolution functions

    •Building and loading weights into a Residual Neural Network, and finetuning it on a classification task

    •Working with weights and biases to optimise hyperparameters

    •Implementing your own backpropagation mechanism

    The material on this page covers the next 8 days of the curriculum. It will cover transformers (what they are, how they are trained, how they are used to generate output) as well as mechanistic interpretability (what it is, what are some of the most important results in the field so far, why it might be important for alignment).

    Some highlights from this chapter include:

    •Building your own transformer from scratch, and using it to sample autoregressive output

    •Using the TransformerLens library developed by Neel Nanda to locate induction heads in a 2-layer model

    •Finding a circuit for indirect object identification in GPT-2 small

    •Intepreting model trained on toy tasks, e.g. classification of bracket strings, or modular arithmetic

    Reinforcement learning is an important field of machine learning. It works by teaching agents to take actions in an environment to maximise their accumulated reward.

    In this chapter, you will be learning about some of the fundamentals of RL, and working with OpenAI’s Gym environment to run your own experiments.

    Some highlights from this chapter include:

    •Building your own agent to play the multi-armed bandit problem, implementing methods from Sutton & Bardo

    •Implementing a Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) to play the CartPole game

    •Applying RLHF to autoregressive transformers like the ones you built in the previous chapter

    With the advent of large language models, training at scale has become a necessity to create highly competent models. In this chapter we will go through the basics of GPUs and distributed training, along with introductions to libraries that make training at scale easier.

    Some highlights from this chapter include:

    •Quantizing your model to INT8 for blazing fast inference

    •Implementing distributed training loops using torch.dist

  3. Interpretability Researcher at Anthropic. Director of the ARENA program. 130 followers · 0 following. London. https://www.perfectlynormal.co.uk/ Achievements. Beta Send feedback. Highlights. Pro. Block or Report. Popular repositories. ARENA_2.0 Public. Resources for skilling up in AI alignment research engineering.

  4. Oct 1, 2022 · CALLUM MCDOUGALL. Hello! This is my blog, which I use to talk about things that interest me. I am one or more of the following: AI Safety Researcher, Film & Music Enjoyer, Fudge Baker, Thread-Art Maker , Hitchhiker's Guide To The Galaxy Fan, Occasional Runner, Clothespeg Carrier , Amateur Front-End Developer, Bad Pun-Maker,

  5. Experience: Anthropic · Education: University of Cambridge · Location: Clapham · 500+ connections on LinkedIn. View Callum McDougalls profile on LinkedIn, a professional community of 1...

  6. www.arena.education › teamTeam — ARENA

    Callum McDougall is a Cambridge MMath graduate and a researcher in AI safety. He founded ARENA, a bootcamp for aspiring AI safety researchers, and designed its curriculum.