email:
mob:
+61 418 359 711

OriginsInfocus

ISSUE 3: May 2012

2012 Data Day

May sees one of ADMA’s key annual events take to the road with a Data Day in Brisbane (10th May), Sydney (15th May) and Melbourne (17th May).

With three international presentations and twelve further sessions arranged in two streams, there will be something for everybody.  Presenters include representatives from Telstra, NAB, TRUenergy, Suncorp and Deloitte Analytics.

This is one of the few events to focus on data as the essential resource to drive creative communications and underpin marketing strategy.

Further information and registration details here

Ways of Using Origins 

This regular newsletter item for users and potential users puts a spotlight on different ways of using Origins to help you make the most of your insight into the cultural characteristics of customers and to maximise value from your investment.  In this edition, we describe how, once you have coded your customers, you can make selections and target relevant offers and messages to the right targets for cross-sell and up-sell.

There are three considerations in defining the selection criteria for a campaign target group:

1.    Level of aggregation
2.    Use of family name
3.    Use of the confidence score

Level of Aggregation

The Origins software currently assigns one of 253 codes to a customer record.  In the majority of contexts, this level of detail may be too great for campaign selections.  Exceptions include cases where organisations are seeking to communicate with specific communities such as Vietnamese, Sikh Indians, Koreans or Armenians.

In most cases, some degree of aggregation will be sufficient.  It may be sufficient to use the standard aggregations recommended by OriginsInfo, or an aggregation that has been designed for a particular organisational or campaign context.  Such an aggregation may group together, for example, the 24 Indian Codes into a single “India” code, or the two Irish codes into “Irish”, or the eight codes for East Asian communities into “East Asian”.

Use of Family Name

In general, the family name is more diagnostic of a person’s likely cultural background and, as discussed in the previous edition, the Origins software places more emphasis on the family name than the personal name.  So, if the business and campaign context suggests some latitude in the targeting, then it may be sufficient just to use the Origins code for the family name to select the target group.

For example, if the campaign is aimed at people with any apparent affinity with Italian culture, then it may be sufficient to make selections based using just the family names that are coded to one of the seven Italian codes.

Use of Confidence Score

The Confidence Score may be used to screen out those prospects where there is an increased risk of inconsistency with campaign objectives or content.

High confidence scores are produced when there is a similarity between the allocations for both the first and family names.  Low confidence scores are produced when there is a conflict between the allocations for first and family names. 

This may occur for a variety of reasons; it may be indicate second or third generation migrant status, western-influenced Asian migrants, or other groups who for a variety of reasons have adopted names that vary from their original cultural background

The Confidence Score cut-off should be set at a level that varies according to the nature of the campaign.  As a guide, a threshold level should be set somewhere between 0.5 and 0.8, although this should be somewhat higher if the proposed campaign is designed for a very specific cultural segment – eg people who have recently migrated from India, or those people of Indian background who seem to retain strongest links with their cultural background.

Follow the Hints and Tips section for further information about the Confidence Score and its interpretation.

Of course, whichever of the three methods is used to define the selection criteria, it may be necessary to adjust the rules to meet other operational considerations, such as budget or minimum required volumes.

For further guidance, please feel free to .

Hints & Tips for Users – Making the Most of Origins  

What do the Confidence Score and the Confidence Category mean?  How are they derived and how do I interpret them?

In the last edition of OriginsInfocus we introduced the concept of the Confidence Score in the context of a discussion about Accuracy and Coverage.

As well as the ability to append an overall Origins code to a name, the Origins software identifies the Origins code for each name element.  Two outputs generated in the Origins processing are the Confidence Category and the Confidence Score.  

Confidence Category

The Confidence Category indicates whether the first name and the family name belong to the same, or to broadly similar Origins codes.  There are four possible values of the Confidence Category – ORIGIN, SUBGROUP, GROUP, and VOID.

If the two names are assigned to the same three-character Origins CEL code then the value ‘ORIGIN’ is assigned.  If they do not have the same Origins CEL but share the first two characters of the three-character CEL code, the value ‘SUBGROUP’ is assigned.  To illustrate the idea, if the first name is Russian and the family name is Ukrainian, the first two characters of the CEL code are “DP”.

Where the Origins CEL of the first name and family names only match on the first character of the CEL code, then the value ‘GROUP’ is assigned.  An example is where both names share the first character of the CEL code “I” (Islamic) as the first character.

If the personal and family names are assigned to quite different Origins groups then the category ‘VOID’ is assigned.  The only exception to this rule is with Origins Groups A (Anglo-Saxon) and B (Celtic).  These are treated as synonymous.  So, for example, a person with an English first name and a Scottish family name would generally be assigned to a ‘SUBGROUP’ category.

The Confidence Category plays a role in the calculation of the Confidence Score.  

Confidence Score

There is no definitive or simple way to decode or scale the confidence score.  This is because of the large number of names in the system and the very large number of permutations of name combinations.  Each score is a function of the individual confidence scores for each first name and each family name.

The higher the score, the greater the confidence can be placed in the allocation of the overall name combination to the particular CEL.  In general, values greater than 0.5 are good for all purposes.  The highest score is around 15 and the lowest scores are below 0.1.  Analysts would do well to report on the distribution of confidence scores within a particular group of customers to understand the overall distribution and to assess the Origins coding characteristics of records belonging to different confidence score bands.  This may assist the business to optimise performance in any campaign deployment.

For further guidance please feel free to .

 

 

 

 

 

 

 

 

 


CEH logo










CEH logo













































 

 

 

 






 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Contact OriginsInfo

Australia and New Zealand

OriginsInfo
email:
mob:
+61 418 359 711

Europe and USA

Experian Business Strategies
Peter Cummings
email: