MMM Glossary

Here is a glossary of 100 common terms used in Marketing and Media Mix Modelling (MMM), Customer Analytics and Marketing Science:

Acquisition – using your marketing and media budget to win new customers into your business.
Activation – setting a campaign to go live and start delivering results.
Addressable media – media that goes to an individual
Ad Server – a server which hosts your digital ads so that they can be served onto the sites where you book media
AdStock – the amount of time a consumer theoretically remembers an ad for after seeing it
Artificial Intelligence (AI) – AI is a form of computer technology in which machines are trained to behave and learn like humans
Attention – a metric that seeks to capture the amount of attention a consumer pays to an ad, rather than simply seeing it. Dwell time is often used.
Attribution – the task of understanding and explaining which marketing and media channels drove which business results.
Awareness – the percentage of your target audience that is aware of your product or service
BARB – Broadcast Audience Research Board, the body that manages the measurement and reporting of TV audience data
Base – sometimes known as the intercept, or ‘constant’ this is the level of sales your business or brand generates without media or advertising spend intervention. Sometimes called Beta0.
Broad Audience – a wide demographic target audience definition such as ‘Adults’, ‘Men’, ‘Women’
Broadcast – channels that reach mass audience quickly through mass communication such as TV or radio.
Categorical variables – are descriptive rather than numeric, for example gender, country or product type
Churn – losing customers
Churn Rate – the rate at which you lose customers, usually measured as the percentage of customers lost per week, month or year
Coefficient – the “weighting factor” that the model estimates for each variable. So, we might say that for every £1 invested, 0.01 (our coefficient) sales are generated i.e. a £100 cost per sale.
Confidence Interval – The expected range in which you would expect a value to fall if that value was sampled multiple times
Consideration – the proportion of your target audience that would consider buying your product or service. Identified through tracking studies like YouGov or bespoke research.
Conversion – the rate at which visitors move to the next action in the path to purchase
Conversion Funnel – a unified view of the journey consumers takes from becoming aware of your product to purchasing and sometimes including post-purchase behaviour.
CPA – Cost per Action, or sometimes cost per acquisition
CPM – Cost per Thousand (the M stands for “mille”)
CPT – Cost per Thousand
CRM – Customer Relationship Management
Cross sectional data – data collected at a specific place or point in time, like a research survey
D2C – Direct to Consumer – selling products or services directly to consumers
Data Science – using data to establish cause and effect using the scientific method
Decay – related to AdStock – the theoretical rate at which the memory of an ad decays
Decomposition – identifying the component parts of a time series data set. For example, identifying underlying seasonality and trend components.
Demo – Demographic – usually applied to a demographic group
Dependent variable -the variable we are seeking to explain. The behaviour of the dependent variable is explained by the independent, or explanatory variables, sometimes also called the x variables.
Dimensionality – in data science, this refers to the number of x variables (columns) in relation to the number of observations (rows).
Direct Marketing – Marketing activity that seeks a measurable response at the individual consumer level
Distribution – the element of the Marketing Mix that concerns the amount of retail coverage a business has. There can also be an online manifestation of this in the form of delivery times, returns policy etc.
eCommerce – online websites that sell products and service and allow consumers to buy direct (see D2C)
Econometrics – the practice of analysing economic data with statistics. Derived from the words “economy” and “metrics”
Error term – e – the part of the model equation that captures the data that can’t be explained in the model. Sometimes also called the remainder term.
so – excess Share of Voice – the excess value when your share of voice is divided by your share of market – higher so has been shown to assist brand sales or share growth
Exploratory Data Analysis – the process of analysing a dataset to ensure that it is suitable for modelling
Effective Frequency – usually used in brand campaigns or launch campaigns. This is the number of times consumers might need to see an ad for it to register.
Feature engineering – adjusting variables such as creating interaction variables to test an effect in a model
Frequency – Sometimes called OTS (see OTS)
ggplot2 – one of the leading graphics packages in the R modelling suite.
GRP – Gross Rating Point -the percentage of your audience that sees an ad
Heteroscedasticity – a certain type of pattern (too much variance) sometimes seen in residuals – it can suggest that the model is not properly capturing all the information it has available to it
Homoscedasticity – another residuals patter where the error is constant, indicating the model is good
Impact – the actual number of consumers who see your ad
Incrementality – business outcomes that are incremental to your static state
Inference – drawing conclusions about a population based on the selection and statistical analysis of a sample from that population
Interaction variable – a variable that is created when you combine two variables to text their combined effect e.g. an increase in ad spends and a change in price
Intercept – also the base and technically the point at which the models predicted line crossed the y axis
Lag – any delay between seeing an ad and acting on it
Lifetime Value (LTV) – the amount of revenue a customer or subscriber generates over the estimated average customer lifetime. Usually between 1 and 5 years depending on the product and category.
Linear response – a non-decaying response function e.g. for every cup of coffee consumed the satisfaction score remained unchanged.
Machine Learning (ML) – is a subset of AI in which machines (computers) use data to make predictions
Marketing Science – applying the scientific method to marketing data
MMM – Marketing Mix Modelling – the technique of using statistical models to examine which variables drive business outcomes
Model – a reproducible mathematical or statistical representation of something that occurs in life
MOSAIC – A customer classification system similar to ACORN above
MRR – Monthly Recurring Revenue – the amount of money a subscriber pays per month whilst in contract or signed up to a specific service.
Noise – random changes in variables from which no or very little value can be extracted
Non-linear response – a decaying response function e.g. the more coffee a person consumes, the less satisfied they are with each additional cup, as opposed to linear response in which case each coffee would just be as satisfying as the last
OLS – Ordinary Least Squares is a regression technique that seeks to minimise the difference between actual and predicted values – related to the term ‘best fit’
OTS – Opportunities to See, a measure of frequency
Optimise – the act of reorganizing a fixed resource to produce more yield from
Optimiser – a tool that can run optimisations to resolve an allocation problem to produce maximum possible business outcomes.
Opt-In – a privacy state in which a consumer can decide whether or not to receive communications from a business.
Outlier – a value in data that is outside the upper and lower quartiles of a dataset
Panel model – a modelling technique which allows us to build models using the same variables from different data sets e.g. countries or cities
PPC – Pay Per Click – a form of search marketing
Prediction – applying your model to data to predict the value of y at x
Programmatic – a system that bids for online display space automatically subject to a set of bid rules
Prospect – a consumer who could buy but currently does not
Python – a programming language conceptually similar to R used to build models
R – a programming language (code) used to run statistical models
Reach – the percentage of your target audience that see one of your ads at least once
Recency Frequency Value (RFV) – a method for scoring and classifying customers using transactional data
Regression – a statistical technique used to find the ‘line of best fit’ relationship between two sets of variables, x and y.
Residuals – The difference between the values predicted by a model and the observed values in the data
Response Curve – The consumer response function to your media investments
Ridge Regression (sometimes called Regularised Regression) is a form of regression that scales down the influence of less important variables in a regression model
RMSE – Root Mean Squared Error – measures the average difference between values predicted by a model and the actual values in the data. A key metric in model assessment
R-Squared – Residuals squared – the total value of the residuals in a model, squared.
R Studio – (Posit) – an integrated development environment, or IDE, like a GUI to write and edit R code
Sales Funnel – a start to finish representation of your prospects’ path to purchase
Seasonality – this is not about the seasons, but about identifying repeating cycles in a time series sample – these could be other days, weeks, months or years
Share of Market (SOM) – your market shares i.e. your sales as a percentage of total category sales
Share of Voice (SOV) – your advertising spends as a share of your category’s ad spend
Spot Match – a technique for matching TV spots to web traffic using a time window, usually of a few minutes
Standard deviation – a measure of the distance of each value in a dataset from the mean if that dataset
Standard error – the standard deviation of the mean of multiple samples, say we tale 10 samples, get a mean of each, we have 10 means. they have different values. we take the mean of these means and calculate the standard deviation of that mean
Static State – your BAU business metrics – the natural level of business you generate without advertising spend input
Sub Group – any target audience that is smaller than the full adult population e.g. Men aged 16-34
TGI – Target Group Index – a major consumer behaviour survey
Time series data – a sample of data that is spread over time, such as sales data by week over three years
Vector – a vector is a string of data that has magnitude and direction e.g. 1,3,5,7,9,11,13 can be called a vector
x variables – our explanatory variables (x1, x2, x3 etc) – these are the variables that we are testing for an effect on our dependent or Y variable
y variable – the variable we want to explain with our explanatory variables (x1, x2, x3 etc)
YouGov – a major consumer survey of behaviour and attitudes, similar to TGI