31
‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Embed Size (px)

Citation preview

Page 1: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

‘Improving the ethnic classification of patient registers’

Centre for Advanced Spatial Analysis, UCL

25th May, 2005

Page 2: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Objectives of seminar

1. Promote awareness of existing tools for targetting health communications at ‘ethnic’ groups

a. Individual patients on registersb. Surgeries and other contact pointsc. Local promotional activities

2. Discussion of ideas for collaborative use of individual patient registers to improve the quality of ‘ethnic’ coding

3. Forum for exchange of informationa. Methodsb. Ethics and other data protection issues

Page 3: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Our conception of ‘Ethnicity’

Multi-variate classification based on a combination of :

Cultural Origin – eg religion, beliefs

Ethnicity – eg country of origin, diet

Language

Page 4: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Background to this seminar

Page 5: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Context : CASA (1)

• 1970s - Neighbourhood classifications used for prioritising public sector initiatives

• 1990s - Application of postcode classifications adopted by commercial companies

• 2002 – CASA becomes involved in the application of Mosaic in health, policing and education 2003

• CASA work with Dr Foster on Slough Diabetes project

Page 6: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Context : CASA (2)

• 2004 – CASA sets up Knowledge Transfer Partnership (KTP) with Camden PCT to develop health applications of geodemographics

• 2004 – CASA wins ESRC grant for ‘quantitative analysis of names’

• 2005 – Camden PCT develops capability in the application of ‘names’ as well as Mosaic to targetting of public health campaigns

Page 7: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Contact details

• CASA website• E/mail addresses

– Pablo Mateos – [email protected]– Richard Webber – [email protected]– Paul Longley –[email protected]

www.casa.ucl.ac.uk/geonom

Page 8: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

‘Quantitative Analysis of Names’

• ESRC funded project– Use of surname as an identifier of cultural

origin• Regional origins of English names• Regional distribution of Celtic names• Current locations of names ‘imported from abroad’

– Jewish– Continental European and Hispanic– Asian– African– Middle Eastern

Page 9: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Identification of potential applications

• Academic / Social Scientific– Study of meaning of names– Studies of historic migration patterns– Social mobility of Celtic migrants to England

• Policy applications– Measurement of ‘social capital’– Differentiation of crude ‘South Asian’ definition– Targetting of public sector communications programmes– Auditing of equal opportunities in employment

Page 10: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Key data files

• 40 million records Experian 1996 GB electoral roll– First name– Surname– Postal area code– Mosaic code

• 26 million records 1881 census

• Summary statistics on name frequencies by region from Anglophone diaspora– US, Canada, Australia, New Zealand, North and Southern

Ireland

Page 11: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Geography of the name ‘Webber’

Page 12: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

% electors with occupational names

Page 13: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

% electors of Welsh surnames

Page 14: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

CEL assignment : Phase one

• Identify 25,000 surnames with > 100 occurrences in 1996

• Assign to hierarchy– English; general name type; detailed name

type– Celtic; country of origin; general type– Imported from abroad; region of origin;

country of origin

Page 15: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Webber

• Level one : English ‘metonym’

• Level two : Metonym ending in ‘-er’

• Level three : Manufacturing occupation

Page 16: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Zhang

• Level one : Imported from abroad

• Level two : East Asian

• Level three : Chinese

Page 17: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Names > 100 occurrences Count

English 19,246

Celtic 3,396

Imported from abroad 2,987

Total 25,630

Page 18: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Muslim and South Asian names (1462)

IMPORTED FROM ABROAD;MUSLIM;AFGHAN 19

IMPORTED FROM ABROAD;MUSLIM;BANGLADESHI 83

IMPORTED FROM ABROAD;MUSLIM;ERITREAN 3

IMPORTED FROM ABROAD;MUSLIM;LEBANESE 2

IMPORTED FROM ABROAD;MUSLIM;MIDDLE EASTERN 125

IMPORTED FROM ABROAD;MUSLIM;NORTH AFRICAN 33

IMPORTED FROM ABROAD;MUSLIM;PAKISTANI & IRANIAN 203

IMPORTED FROM ABROAD;MUSLIM;PERSONAL NAME 85

IMPORTED FROM ABROAD;MUSLIM;SOMALI 40

IMPORTED FROM ABROAD;MUSLIM;SUDANESE 1

IMPORTED FROM ABROAD;MUSLIM;TURKISH 88

IMPORTED FROM ABROAD;OTHER SOUTH ASIAN;HINDI 254

IMPORTED FROM ABROAD;OTHER SOUTH ASIAN;NEPALESE 2

IMPORTED FROM ABROAD;OTHER SOUTH ASIAN;NORTH INDIAN 184

IMPORTED FROM ABROAD;OTHER SOUTH ASIAN;SIKH 290

IMPORTED FROM ABROAD;OTHER SOUTH ASIAN;SOUTH INDIAN & SRI LANKAN 50

Page 19: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Phase one assignment method(25,000 names with > 100 occurrences)

1. General knowledge

2. Identification of top postal area and level of concentration in it

3. Identification of top Mosaic type and level of concentration in it

4. Identification of concentration in 1881

5. Frequencies in other Anglophone countries

Page 20: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005
Page 21: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

C20 : Suburban Comfort / Asian Enterprise

Wakemans Hill, Colindale, NW9 0UU

The Warren, Heston, TW5 0JW Headcorn Road, Thornton Heath, CR7 6JS

Himley Crescent, Wolverhampton, WV4 5DA

Page 22: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

D26 : Ties of Community / South Asian Industry

Aberdeen Place, Bradford, BD7 2HG

Ivy Road, Luton, LU1 1DL Edmundson Road, Blackburn, BB2 1HL

Osborn Road, Sparkbrook, Birmingham, B11 1TT

Page 23: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Status and Asian names

Asian Enterprise

South Asian Industry

Mayat 145 9533

Lorgat 971 8840

Gorasia 7622 275

Page 24: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

D27 : Ties of Community / Settled Minorities

Algernon Road, Lewisham, SE13 7AP

Essex Road, Leyton, E10 6BT Guildersfield Road, Streatham, SW16 5LS

Melbourne Road, Walthamstow, E17 6LR

Page 25: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

F36 : Welfare Borderline / Metro Multiculture

Broadwater Farm, Tottenham, N17 6HT

Hillcrest, Highgate, N6 4EX Kenninghall Road, Lower Clapton, E5 8DG

Samuel Street, Woolwich, SE18 5LJ

Page 26: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Output

• Directory assigning a Cultural / Language / Ethnicity code to each name with more than 100 occurrences on the GB electoral roll

Page 27: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Phase two assignment(all surnames > 5 occurrences)

• Rank first names by frequency

• Allocate names to CEL categories where possible

• Identify for each surname the proportion of associated first names in known CEL categories

Page 28: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005
Page 29: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Selected first names

Agapios Antigoni Sotiri Sotiris

Total occurrences

10 57 11 224

Surnames not British

6 34 7 143

Surnames Greek

3 19 5 75

Page 30: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Output

• Database giving for 60,000 surnames ‘imported from abroad’– % electors by CEL of first name– Most common cell (three level hierarchy)

• Database giving for 60,000 first names ‘imported from abroad’ – (3.2m occurrences)– % electors by CEL of surname– Most common cell (three level hierarchy)

Page 31: ‘Improving the ethnic classification of patient registers’ Centre for Advanced Spatial Analysis, UCL 25 th May, 2005

Evaluation of solution• Seems to work well for all ethnic groups other than Caribbeans• CEL overlaps between surname and first name

– South Asians and Muslims – 80%– Africans, Turks, Cypriots, Chinese – 50%– Hispanics – 20%– Other Europeans – 8 – 15%– Jew – 4%– Irish, Scots, Welsh – 3%

• High overlap between certain CELs – within Muslim group– Spain, Portugal, Italy– Netherlands, Germany and Czech Republic

• Confusion among serial migrant groups– Hispanic migrants to India– Chinese migrants to West Indies