SciELO - Scientific Electronic Library Online

 
vol.33 issue57Capital gain, real wages and labor efficiency in the manufacturing sector in ColombiaHave been informationally efficient stock markets? author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Apuntes del Cenes

Print version ISSN 0120-3053

Apuntes del Cenes vol.33 no.57 Tunja Jan./June 2014

 

Artículo de Investigación

Occupational segregation, selection effects and gender wage differences: evidence from urban Colombia

Jairo Guillermo Isaza Castro*

* Associated professor, Universidad de La Salle. Bogota, Colombia. Economist, Universidad Católica de Colombia, Bogotá. MA in Development Studies, International Institute of Social Studies, The Netherlands; PhD in Economics, University of Sussex, United Kingdom. My sincere acknowledgements to Prof. Barry Reilly for his valuable suggestions on an earlier draft of this paper. The author is also thankful for the comments received from an anonymous referee and from the participants at the Sussex DPhil Conference organised by the Department of Economics, University of Sussex (Brighton, UK) on November 2009 and the Third Alban Conference celebrated in Porto (Portugal) on June 2009. Usual disclaimers apply. Contact: jgisaza@gmail.com and jisaza@lasalle.edu.co

Fecha de recepción: 11 de octubre de 2013 Concepto de evaluación: 7 de febrero de 2014 Fecha de aprobación: 21 de marzo de 2014


Abstract

This paper assesses the effects of occupational segregation on the gender wage gap in urban Colombia between 1986 and 2000. The empirical methodology involves a twostep procedure whereby the occupational distributions of workers by gender are modelled using a multinomial logit model in the first stage. In the second stage, the multinomial logit estimates are used not only to derive a counterfactual occupational distribution of women in the absence of workplace discrimination but also to correct for selectivity bias in the wage equations for each occupational category using the procedure suggested by Lee (1983). Besides the explained and unexplained components in conventional decompositions of the gender wage gap, this methodology differentiates between the justified and unjustified effects of the gender allocation of workers across occupational categories. The results for urban Colombia indicate that controlling for selectivity bias at the occupational category level is found to be relevant in all years reviewed in this study. They also suggest that a changing composition of the female labour supply in terms of un observables (i.e., ability and motivation) is playing a role in the dramatic reduction of the observed wage gap.

Keywords: occupational segregation, gender wage gap, multinomial logit, selection bias, Colombia.

JEL: J16, J71, C24.


Resumen

Este documento analiza los efectos de la segregación ocupacional en la brecha salarial por género en Colombia urbana entre 1986 y 2000. La metodología empírica involucra un procedimiento en dos etapas por el cual las distribuciones ocupacionales de trabajadores por género son modeladas a partir de un modelo logit multinomial en una primera etapa. En una segunda etapa, los estimativos del modelo logit multinomial se usan no solo para derivar una distribución ocupacional contra factual de las mujeres en ausencia de discriminación en el lugar de trabajo, sino también para corregir por sesgo de selección en las ecuaciones de ingresos laborales por género para cada categoría ocupacional, a través del procedimiento sugerido por Lee (1983). Además de los componentes explicados y no explicados en descomposiciones salariales convencionales, esta metodología diferencia entre los efectos justificados y no justificados de la distribución de trabajadores por género a lo largo de categorías ocupacionales. Los resultados para Colombia urbana indican que controlar por sesgo de selección en el nivel de categoría ocupacional es relevante en todos los años revisados en el este estudio. Ellos también sugieren que los cambios en la composición de la oferta de trabajo femenina en términos de variables no observables (por ejemplo, motivación y habilidad) están jugando un rol fundamental en la reducción de la brecha de ingresos por género registrada para este período.

Palabras clave: segregación ocupacional, brecha de ingresos por género, logitmultinomial, sesgo de selección, Colombia.


INTRODUCTION

Most of the empirical literature on gender differences in the labour market has focused either exclusively on wage discrimination or occupational segregation. Empirical research linking both aspects of the differentiated situation of women in the labour market is still scarce. While much of the economic research has been motivated by the ‘taste for discrimination' approach proposed by Becker (1971), the segregation dimension has merited less attention within this framework. The relative scarcity of applied economic research on the relationship between occupational segregation and gender/ ethnic wage discrimination may be explained by the fact that Becker's original model of discrimination does not explicitly incorporate the segregation dimension. In this sense, Baldwin et al. (2001) propose an extension to this approach by including a hierarchical dimension in which men dislike to work for women even in the case they do not object to work along side women. As a result, their model not only predicts that women's participation is exponentially decreasing with respect to job hierarchy but also that women's wage disadvantage in managerial positions is, at least, partially explained by a compensation mechanism of men's dislike for female supervision. Although we do not attempt an empirical test of all propositions derived from Baldwin et al.'s (2001) model, it is still relevant to highlight that the connection between gender wage discrimination and occupational segregation is theoretically grounded.

Since the seminal contributions of Oaxaca (1973) and Blinder (1973), a number of decomposition techniques of the gender wage gap have been proposed to distinguish between the effects of explained differences in human capital and other characteristics, on the one hand, and the effects of unexplained differences in returns to those characteristics or discrimination, on the other. Blinder (1973), in particular, suggested the use of dummy variables in the gender wage equations to control for the effects of occupations. This dummy variable approach implies that the gender distribution of jobs is justified or randomly distributed. However, it has been argued that if the gender distribution of occupations is subject to some sort of discrimination, as implied by the occupational segregation literature, the dummy variable approach is inadequate (MengandMiller, 1995, Miller, 1987, Reilly, 1991). Therefore, as occupational attainment is affected by discrimination, gender wage differences may be divided at least into two broad sources: first, within-occupations wage differences which are related to the explained and unexplained components mentioned above and, second, between-occupations wage differences due to job discrimination or occupational segregation. As in conventional wage decompositions, some part of the wage differential due to occupational segregation could be justified but some part may remain unexplained.

To the best of our knowledge, Brown et al. (1980) was the first to formulate a method to explicitly incorporate the effects of occupational segregation into the analysis of the gender wage gap. Using a multinomial logit, Brown et al. (1980) modelled the male occupational distribution to produce a counterfactual female occupational distribution based on the set of female average characteristics and the estimated coefficients from the male subsample. Thus, besides the explained and unexplained components, the gender wage gap is further disaggregated into a portion due to explained gender differences in the allocation of workers and a portion due to occupational segregation. Other studies (Liu et al., 2004, Neuman & Silber, 1996) have implemented a similar multinomial logit approach to decompose wage differences between ethnic groups while Miller (1987) relied on an ordered probit to model the gender distribution of occupations.

As suggested by Heckman (1979), the estimation of earning functions by gender poses a potential problem as they are drawn from a non-random sample of individuals who select themselves into the labour force. This creates well-known problems for gender wage gap decompositions given the fact that female labour force participation is markedly lower than male participation. But as suggested by the occupational segregation literature, access to some occupations may also be highly differentiated by gender (i.e., Anker, 1997, Borghans & Groot, 1999, Grazier & Sloane, 2007, Hakim, 1992). This suggests another important source of selectivity bias which has not been accounted for in most of the empirical literature about the gender wage gaps and occupational segregation.

The aim of this paper is to contribute to this literature by implementing a methodology proposed by Reilly (1991) which simultaneously controls for the effects of occupational segregation and corrects for selection into different occupations using Lee's (1983) multinomial logit based procedure. With an empirical application to urban Colombia, we find that controlling for selectivity bias at the occupational category level is found to be relevant in all years reviewed in this study. The results also suggest that a changing composition of the female labour supply in terms of un observables (i.e., ability and motivation) is playing a significant role in the dramatic reduction of the wage gap in this country. The rest of this paper is organised as follows. The next section formulates the empirical model and the third explains the data. The fourth provides some background of the country while the fifth presents the results and discusses the findings. Finally, the fifth section concludes.

METHODOLOGY

Brown et al. (1980) formulated an empirical framework in which human capital and productivity differences explain simultaneously the variation in wages and occupational attainment of men and women. Using a multinomial logit to produce a counterfactual of the female distribution of jobs using the coefficients for the male subsample, this approach provide a decomposition of the gender wage gap not only into differences in coefficients and average characteristics between men and women, but also into differences from explained and unexplained components of the gender occupational distribution. Formally, this approach may be formulated as follows. Let the male and female log wage equations be

where X is a vector of characteristics, β is a set of coefficients to be estimated acrossjoccupations for the male (= m) and female (= ƒ) subsamples and ε is an error term. Let Pm and Pƒ be the sample proportions for each gender. The gender wage gap for the overall labour force with k occupational categories may be expressed as

After rearranging terms and having P *j as the counterfactual distribution of female employment in the absence of segregation, the gender wage gap may be decomposed into four terms:

where the fist term on the right-hand side represents the unexplained component of the gender wage gap due to differences in returns from observable characteristics, the second is the component due to the level differences in those characteristics, the third is the component due to the explained allocation of workers and the fourth represents the portion of the wage gap due to occupational segregation1. According to Liu et al. (2004), the first two components may be interpreted as the within occupation wage differential while the last two denote the between occupational wage differential.

It has been established that ignoring selection effects when present may yield a bias in the coefficient estimates. In the case of the labour market, this possibility is present in the endogeneity of a workers' occupational choice with respect to the income determination process within specific occupations (Gyourko and Tracy, 1988). While the Heckman (1979) procedure may be valid to correct selection bias in terms of the binary case of participating or not participating in the labour market, this approach has evident limitations when dealing with segmented labour markets and/or where the decision to participate is taken across multiple choices such as economic activities and/ or occupational categories. Using Lee's (1983) generalised method which allows a correction for selectivity bias across a set of mutually exclusive groups, Reilly (1991) refined the approach proposed by Brown et al. (1980) by using the same multinomial log it model results (also for women) in order to create a selection term as an additional regressor into each of the occupational wage equations2. Assuming that the probability P of attachment of the i individual to the j occupational choice is determined by the vector Z, the multinomial log it model of occupational choice may be represented as

Where the base outcomejis set equal to 0 as required by the Theil normalization. Estimates from (5) are used not only to construct the counterfactual occupational distribution P in (4) (with the coefficients drawn from the male subsample to simulate the occupational distribution for the female subsampleas if they were treated as their male counterparts) but also to correct the occupational wage equations for selection bias. Thus, the wage equations in (1) and (2) corrected for selectivity bias may be rewritten as

and

where φ and J denote the standard normal density function and the inverse of the cumulative density function, respectively, of predicted probabilities given by Zjm Yjm and Zfj Y fj ; sj parameters capture the strength of the correlation between the un observables determining Wj and those determining the jth occupational attachment modelled by the MNL. Lee’s (1983) approach is analogous toHeckman’s (1979) two-step procedure in the sense that themaximum likelihood estimates from (5) for each gender subsample are inserted in (6) and (7) in order to construct a selection term. Thus, (6) and (7) may be conveniently rewritten as

and

where, in the case of the male subsample, λ =-φ (J(zmjγmj)) F(zfjγfj) and ∈mj is an error term3. With a large sample, heteroscedasticity consistent standard errors for (8) and (9) can be obtained using the White/Huber sandwich estimator (White, 1980) though, bootstrapping is also possible. We preferred White's (1980) procedure as this provides a reasonably accurate estimate of the variance/covariance matrix in this context.

Data

The empirical application of the model described above relies on household survey microdata from the seven largest metropolitan areas of urban Colombia which represent around 36 per cent of the country's population and nearly onehalf of its urban population. Household surveys in urban Colombia are gathered by the Government on a quarterly basis since the mid-1980s and our dataset comprises five repeated cross sections for all quarters of 1986, 1990, 1995, 2000 and 20044. On average, each of these years provides microdata on 160,000 individuals between 18 and 65 years old and 66,000 workers in full time occupations (which, by Colombian standards is 40 or more hours per week).The microdata give information on labour earnings, number of weekly worked hours, industry, region, demographic variables (i.e., age, educational attainment, marital status), and the socioeconomic strata of the incumbent household which is based on housing and neighbourhood characteristics. Furthermore, the surveys include information about occupations using a consistent classification of 82 categories over the entire period which, at the two-digit level, is identical to the International Standard Classification of Occupations ISCO-68. For the purposes of this research, we regrouped the 82 original categories into five broad occupations for computational convenience in order to guarantee a large enough sample size for each (see Table 1) plus one additional category which comprises both, individuals out of the labour force and part-time workers (less than 40 hours per week). We attempted also an alternative grouping using the ISCO-68 at one digit with nine categories but the small number of female observations in some cells made this approach unfeasible for accurate statistical inference.

EMPIRICAL RESULTS

Determinants of occupational attainment

As customary in similar empirical applications, the occupational attainment model described above is reduced form in nature and assumes that the occupation held by an individual is the result of supply and demand factors (Brown et al., 1980, Liu et al., 2004, Reilly, 1991). On the supply side, the decision of the individual to engage in a certain occupation is determined by a utility function in terms of expected income, personal tastes and family structure. Although most of those variables are not observed in the Colombian microdata, our model is rich invariables related to the household structure: marital status, a dummy for household heads, household unemployment rate, number of both infants and children in the household; and a dummy for households with a domestic servant5. Personal tastes and preferences are not directly observable but we attempt to proxy them with neighbourhood characteristics based on average log labour income and average education for the corresponding sampling segment6. We include also dummies for cities and neighbourhood socioeconomic strata which reveal housing conditions and access to urban infrastructure. On the demand side, the willingness of an employer to hire a worker in a given position is assumed to be dictated by productivity signals as well as firmand jobspecific tastes for discrimination. Our multinomial logit specification includes as productivityindicators a conventionalproxy of labour experience (i.e., age – years of schooling – 6) with a quadratic term and, dummies for primary, secondary and university education. Firm and job-specific tastes for discrimination are, thus, relegated to the residual together with other un observables.

The maximum likelihood estimates of the multinomial logit described in (5) are presented in Tables 2a and 2b. In order to conserve space, we present only the results for 20047. It is very well known that coefficient interpretation is not entirely transparent in the logit specification. For this reason, we prefer to present the marginal and impact effects computed at the average of sample characteristics rather than the crude multinomial logit coefficients. None the less, their interpretation should be taken with some caution asthe average characteristics may not be entirely appropriate at the extremes of the probability distribution of occupational outcomes. Beyond these caveats, some inferences can be safely made on the basis of these results. For instance, our proxy of labour market experience for both, men and women is statistically significant beyond conventional levels in all occupations. The significance of the quadratic term of labour market experience suggests an inverted Ushaped relationship for all categories and appears to be inversely correlated with the skill level. While the maximum probability of attachment to the professional & managerial category is located at 18 years of experience amongst men and 16 years amongst women, the peaks for those in semiskilled and unskilled occupations are, respectively, at 26 years and 24 years for the male and female subsamples. On the other hand, the estimated effects for the dummies by educational level suggestin general that the higher the educational level of the individual, the more likely he or she is to be found in occupations where academic credentials are relevant. For example, possessing a university degree increases the probability of being in professional occupations by 34.7 percentage points amongst the male subsample compared to 38.7 percentage points amongst the female subsample.

Variables pertaining to the family structure appear also to play an important role in the choice of occupation with a clear differentiation in terms of gender. Among such variables, both the number of infants and children are found to be negatively correlated with the probability of female participation in all categories. Conversely, the number of infantsis found to increase the probability of being in all occupations amongst menwhile the number of children reduces the probability of females being working in professionals, clerks and sales jobs. Marital status is also highly differentiated in terms of gender in as much as being single raises the probability of attachment of women to most occupations (except professionals & managers) while its effect is not statistically different from zero for men in all categories (except in semi-skilled occupations). In summary, the extent of differentiation in marginal/impact effects from variables related to the household structure somehow resembles asymmetric household responsibilities and gender roles which, in general, tend to reinforce patterns of occupational segregation8.

Lastly, neighbourhood (or sampling segment) variables tend to reflect both, the tastes and the effects of the household social networks on the occupational outcome. Our results for 2004 suggests that women and men living in neighbourhoods where higher educational levels prevail are more likely to be attached to occupations where educational credentials are more relevant: professionals & managers and, clerks & sales personnel. Likewise, men and women living in neighbourhoods with higher educational levels are less likely to be found in semi-skilled and unskilled occupations.

Table 3 illustrates sample proportions of workers across the five occupational distributions defined for this study. It also displays the counterfactual distributions of female workers,, which arecomputed by applying the multinomial logit coefficients for the male subsample to the female subsample. Two categories, clerks & sales personnel and, service workers, represent more than 70 per cent of female employment in all years reviewed here. As the categorisation of occupations is quite broad, we observe also that the proportion of workers in professional & managerial positions in terms of gender is remarkably egalitarian and women have been increasing their share since 1986. Women appear underrepresented in the blue-collar semi-skilled and unskilled positions in all years. The counterfactual estimates indicate that women are over represented in service positions where about two thirds of the female proportion estimates indicate that the proportion of of workers is ‘unjustified'. Conversely, the female labour force allocated to this women are clearly underrepresented in category should be, on average, more than unskilled jobs as the counterfactual 7.5 times higher.

Our counterfactual modelling strategy assumes that the allocation of workers is primarily a function of personal, household and neighbourhood characteristics which are evaluated with the coefficients for the male subsample. Given that the distribution of those characteristics may differ in terms of gender, it is assumed that some part of discrepancies in the gender distribution of workers across occupational categories is justified. For this reason, Table 4 presents factual and counterfactual estimates of dissimilarity indices to see more precisely the extent to which the gender distribution of jobs would be altered in the absence of discrimination in the access to version of the Duncan and Duncan index occupations. The first one is the widely proposed by Karmel and Mac Lachlan known measure proposed by Duncanand (1988) which is interpreted as the Duncan (1955) which is interpreted as proportion of female workers who would the percentage of workers of either sex have to exchange jobs with male workers who have to move from occupations in if an even gender distribution of jobs is to order to achieve an even distribution of be achieved without altering the overall jobs by gender. The second is a modified employment structure.

Estimates for the Duncan and Duncan index reveal little changes between 1986 and 2000, while those for the Karmel and Mac Lachlan suggest a slight increase in the extent of segregation. While the first one fell by just 2.8 per cent between 1986 and 2004 the second grew by 2.9 per cent over this same period. Furthermore, if women were treated as men in the occupational selection or sorting process, as assumed in the counterfactual scenario, both indices in 2004 would be reduced by more than 97 per cent. This gives an indication that most of the occupational segregation based on the five occupational categories implemented in our multinomial logit specification above is unjustified. It should be noted, however, that a set of different estimates using a classification of 82 occupations indicates that occupational segregation has experienced a substantial reduction during this period, particularly amongst labour force with university education (Isaza & Reilly, 2009). The magnitude of standard errors computed using a bootstrap technique is remarkably small, with zeros in the first three decimal positions in most cases either in the case of the full 82 occupations or in the aggregated version of five categories presented here. For this reason, we judged useless to report them on the table.

Wage equations and selection effects

Now our attention turns to the interpretation of wage equations corrected for selectivity bias. Our log wage specification is quite austere in terms of human capital and productivity characteristics as the Colombian data do not include in all waves explicit information about labour market experience, and characteristics related to specific types of education and abilities possessed by the individual. Hence, we have to rely only on those variables which are available in all survey waves between 1986 and 2000: age, its quadratic form, and the number of years of formal schooling in the form of a disjointed spline function with one knot at 11 years of formal schooling. The inclusion of education in the spline form aims at a consistent characterisation of the labour market across different occupational categories over the 19 year period analysed in this study. In the case of Colombia, the completion of 11 years of education constitutes a landmark in the educational system of this country as this enables access to professional and most vocational training programmes. Furthermore, access to compulsory military service for males is shortened for those with complete secondary education(i.e., 11 years of formal schooling). Complete secondary education, with a certificate of previous compliance of military service for males, are valuable credentials to access most of jobs in the formal sector. We include also seasonal dummies and controls for cities, as well as two dichotomous variables depicting some job characteristics: one for government employees, and another for own-account workers. As the literature on wage equations by gender is quite abundant, even in the Colombian case, our interpretation emphasises on the role of selection effects9.

A distinctive feature of this empirical application is the inclusion of specific selection terms in the wage equations for each occupational category using the estimates from multinomial logit equations of occupational attainment by gender. Following the interpretation suggested in Gyourko and Tracy (1988) and Reilly (1991), the selection effect can be computed as the product of the mean value for the selection variable (φ/F= λ) multiplied by the coefficient value (σ j ρ j = θ) (see expressions 6 and 7, above). According to this method, selectivity effects are found to be statistically significant for both, womenandmen inmost years and across the majority of occupations (see Tables 5a and 5b). In the male subsample, the strongest effects are found amongst clerks & sales workers followed by those in the professional & managerial category. In the former, the results for 2004 indicate that men earn, on average and ceteris paribus, 95 per cent higher wages than an individual drawn at random from the male subsample while in the latter this differential is more than 89 per cent (see Table 5a). In the female subsample, selection effects are the strongest amongst professionals & managers whose hourly wages in 2004 are 59 per cent higher than those of an average woman randomly drawn from the female subsample (see Table 5b).

A possible rationale for the results outlined above is that individuals who are selected into occupations where academic credentials are highly relevant (i.e., professionals and clerks) possess some productivity characteristics such as motivation and ability which are un observed by the econometrician but highly rewarded in the labour market. If the set of unobservables is highl y correlated with both, labour productivity and selection into some particular occupations, the magnitude of selection effects into these categories should be substantially larger compared to other occupations. This seems to be the case of men and women in professional & managerial occupations (and, to a lesser extent, men in clerks and sales personnel) whose selection coefficients are substantially larger than the rest of the labour force. To illustrate the opposite case, workers in the unskilled category report the lowest selection effects with -14.8 and 64.4 per cent in 2004 for the male and female subsamples respectively. Both selection effects are statistically different from zero at the one percent level. Selection effects in this category, as opposed to the case of the most qualified, imply that unskilled workers are endowed with a set of unobservables such as motivation and ability well below the average of their gender subsamples and consequently secure lower wages. This also suggests the possibility that differences inunobservables are more clearly differentiated when the labour force is divided into several occupational categories compared to the dichotomous approach of labour participation which tends to overshadow selection effects, particularly among male workers.

Overall the trends in the selection effects between 1986 and 2004 suggest opposite patterns across gender groups, as they are increasing for the male subsample and decreasing for the female subsample. A plausible explanation for these results is the diverging patterns of labour force participation in terms of gender over these years, with an increasing proportion of women enrolling in the labour market which makes selection effects less differentiated between those inside and outside the labour force. A decreasing trend of male labour force participation may be performing an opposite effect in the male subsample. Interestingly, selection effects appear to be larger for the male subsample and statistically significant in most cases. This finding contrasts with some empirical applications in urban Colombia where the conventional probit approach based on Heckman (1979) suggests that selection effects are mainly relevant only for the female subsamples (Nunez et al., 2005, Núñez & Sánchez, 2002, Ribero, 2003). On this it should be stressedthat our multinomial logit approach is implicitly controlling for two different processes. The first one which relies on the individual decision to participate or not in the labour market and another which has to see with the sorting of individuals across different occupations once the decision to participate has been taken. While the former is a self-selection decision of the individual based on personal circumstances, the second is a selection process driven mainly by employers' preferences based on their valuations of different individual characteristics (i.e., educational level and previous labour market experience).

Decomposition of gender wage gaps by occupational categories

In this section we present the results of the gender wage gap decomposition by occupational categories. As selection effects are statistically significant in most cases, we extend the decomposition by including the effects of the selection terms In this section we present the results of in the decomposition. Thus, the log wage the gender wage gap decomposition by gap for thejcategory may be expressed as

where the first term on the right hand side represents the portion of the wage gap due endowments and the third comprises the portion due to differences in treatments, the second captures the portion due to differences in endowments and the third comprises the portion due to differences in selection effects. The results of expression (10) for all years are reported in Table 6.

The magnitude of the gender wage gap contracted in all occupational categories between 1986 and 2004. The largest reductionwas recorded amongst clerks and sales workers, where the differential fell by three quarters from 17.6 per cent in 1986 to 4.6 per cent in 2004, the lowest of all five groupings. In the case of professionals & managers, the log hourly wage gap fell by almost two thirds over the same period while it decreased by more than one half amongst unskilled workers. The smallest reduction is found for service workers, traditionally the occupational category with the largest proportion of women where, incidentally, the differential in 2004 was the highest of all occupational groups, with 25.5 per cent. It should be noted that those differentials are statistically significant in all cases, at least, at the five per cent level.

Regarding the decomposition methodology outlined in equation (10), differences in treatment (or what may be termed as differences in returns to observable characteristics which are presumably due to discrimination) explain most of gender wage gap for all occupational categories (except amongst service workers) over most of the five years included in this study. This component has also increased its magnitude between 1986 and 2004 in two occupational categories, clerks & sales and, semi-skilled workers while it got smaller for professionals & managers and, unskilled workers. It has been negative amongst service workers in most of the years which suggests that women's observable characteristics are better rewarded compared to their male counterparts in this particular category. This may explain in part why this is the occupational category with the highest proportion of women in the labour market of urban Colombia.

Differences in treatments are partially offset by selection effects which are found to be negative, thus reducing the size of the wage gapin most years. In fact, differences in selection are found to reduce the wage gap in all occupational categories and years, except amongst service workers from 1995 onwards and unskilled workers in 2000. Implicitly, the inclusion of selection effects in the decomposition outlined in (10) assumes that the level of motivation, ability and other unobservables is homogeneous across the gender subsamples but their remuneration is gender differentiated (Gyourko& Tracy, 1988). This provides some evidence that unobserved variables consigned to the selection term might be playing an important role in reducing gender wage differences in urban Colombia.

The portion of the wage gap due to differences in endowments, which include human capital and other observable characteristics, experienced a net reduction in all occupational categories between 1986and 2004. This suggest that another force behind the reduction of the gender wage gap in urban Colombia over these years is the rapid educational improvement of women relative to men which has been documented in the literature for this country (Isaza, 2002; Acosta & Isaza, 2007). This reduction is particularly important for workers in clerks & sales positions and, to a lesser extent, in semiskilled jobs where the endowment component is negative for most years.

The effects of segregation on the gender wage gap

As previously noted, the gender hourly wage gap in urban Colombia presented a dramatic reduction across all occupational categories over the years examined in this study. Overall, it fell by more than two thirds from 27.1 per cent in 1986 to 9.8 per in 2004.This is a sizeable reduction by international standards if we consider, for instance, that in the United States the gender annual wage gap contracted by just one third over the same years, from 35.0 per cent to 23.4 per cent (National Committee on Pay and Equity, 2008).

Table 7 presents the results of the decomposition outlined in equation (4) for all five cross sections analysed in this study. Thefirst three componentswere already discussed in the preceding section, but we include two additional terms: one reflecting the explained distribution of jobs by gender and, another representing the unexplained distribution of jobs due to occupational segregation. Therefore, the inclusion of the selection component implies that expression (4) could be reformulated as

where the third element on the right hand side depicts the portion of the wage gap explained by differences in selection.

As anticipated in the analysis by occupations presented above, the treatments component dominates most of the evolution of the gender wage gap over all cross sections examined here. The decomposition results for 2004 indicate that the gender wage gap solely explained by differences in treatment is roughly 29 per cent, which is higher than the magnitude of the same component in 1986, around 23 per cent. This indicates that most of the gender wage gap in urban Colombia concedes a large unexplained component, as far as it can be inferred from the variables included in the model, and probably suggests that the extent of discrimination is still quite pervasive.

The effects of the unequal returns to human capital and other observable characteristics is counterbalanced by the effects of the selection component, which comes up negative in magnitude for all five cross sections and suggest that the rewards to unobservable characteristics are actually helping to reduce wage differences between men and women in urban Colombia. In fact, it could be argued that most of the reduction in the gender wage gap in this country between 1986 and 2004 is driven by selection effects, reaffirming our conclusion from the previous section that gender differences in returns to unobservable characteristics such as ability and motivation are actually playing a central role in the reduction of wage differences by gender in this country. If we set the selection effects to zero (see Gyourko and Tracy, 1988), the story would be less promising: the gender wage offer gap would have changed from 29.9 per cent in 1986 to 28.4 per cent in 2004, a rather marginal reduction over 19 years. Ignoring selection effects implies that the comparison is performed using wage offers and implies that both, the returns to, and the level of unobservables are the same across gender groups so and in(4* cancel each other. To reconcile this line of analysis with what the statistician gets from the actual labour market data as the mean wage gender gap, we need to emphasize that the wage offer estimates control for selection effects which are then set to zero.This differential provides an estimate of the expected wage gap in the labour market for a couple of men and women with average characteristics randomly drawn from the sample. This is a less intuitive approach in as much as the traditional comparison of the conventional wage offer gap is replaced by a less appealing measure of wage offers which are not readily observable in the data.

However, the comparison of both approaches provides some light in the understanding of the evolution of gender wage differences in urban Colombia and the paramount role of selection effects. If the evolution of hourly wage offers indicates almost no change in the magnitude of the wage offer gap but the dynamic of observed mean hourly wages points to a dramatic reduction, selection effects are actually playing a key role in this story, perhaps, by selecting the best individuals in terms of unobservables. The female labour supply in urban Colombia is improving its average productivity over time through the accumulation of unobservable characteristics that enhances productivity and so, it manages to reduce the magnitude of the observed hourly wage gap by nearly two thirds over 19 years, from 1986 and 2004. Although conditions in the labour market have not improved as far as differences in treatment are concerned, the supply response has been sufficiently strong to compensate and actually reduce the extent of wage inequality by gender.

As in expression (4*), wage differences can also be due to differentialsin endowments which reflect the effects of differences in human capital and other observable characteristics. This component has decreased its importance on the magnitude of the wage gap over the years covered in this study. This is, at least in part, the result of a better human capital improvement of the female labour force with respect to men commented on the literature of this country (Isaza, 2002, 2013; Isaza & Reilly, 2009). Therefore, the contribution of the endowment component to the overall gender wage gap fell from just above one fifth to less than one tenth between 1986 and 2004.

Regarding the unequal distribution of jobs by gender, our decomposition results indicate that the explained allocation of workers component is not statistically different from zero along all years examined here. In contrast, the unexplained component originated as a result of occupational segregationis increasingly contributing to the reduction of the overall gender wage gap in urban Colombia. In other words, the unjustified portion of the dissimilaral location of workers across the five occupational categories defined in this study (this is, the pure segregation component) has actually helped to reduce the wage disadvantage against women in the labour market of urban Colombia, in particular, in 2000 and 2004 when this helped by itself to reduce the hourly log wage gap by one half. Perhaps, these results are driven by a highly aggregated classification of occupations based on only five broad groupings in which the effect of a changing balance in the gender composition of more narrowly defined occupations is obscured. It must be noted that our segregation indicators presented above reveal little change between 1986 and 2004, while the set of estimates (not presented here) based on 82 occupations defined by the Colombian classification reveal a substantial decrease in occupational segregation over the same years.

FINAL REMARKS

This paper has found that controlling for selectivity bias is justified for both genders as opposed to the conventional probit approach using the Heckman procedure in which selection effects are frequently found unwarranted for the male subsample. Our findings suggests that a finer characterization of the selection process into the labour market, either in terms of broad occupational categories or segments (i.e., formal/informal) may prove fruitful to better inform the nature of gender wage differences. We have adopted a multinomial logit approach as a departure point but other alternatives such as the ordered or multinomial probit may be more appropriate in cases where the categorization of different occupations is more clearly informed by hierarchical variables of productivity, prestige or social perceived status.

Our modelling strategy has provided interesting insights into the underlying forces behind a dramatic reduction of the overall wage gap in urban Colombia which, by international standards, is by itself a remarkable case. The decomposition results indicate that both, at the level of individual occupational categories and for the overall average for all categories, the unexplained component of the gender wage gap is the main force behind persistent levels of wage disadvantage against women in urban Colombia. This component is by itself in 2004 larger in absolute and relative terms compared to 1986. A decreasing portion of the wage gap is also explained by differences in endowments in favour of men, which are being reduced over time thanks to, most likely, better progress of women in terms of education compared to men.

Our decomposition results also suggest that selection effects are a key element in the history of the evolution of gender wage gaps in urban Colombia asthey appear to reduce the magnitude of labour income differences between women and men over all years. Although our modelling strategy renders direct comparison to other papers on this topic in Colombia, some authors have found selection effects as a key determinant of gender wage gaps in this country10. To some extent, this finding suggests that the composition of the female labour supply is changing in terms of unobservables in order to yield a net reduction in the overall wage gap. Probably, the pool of female workers entering the labour force is changing its composition as employers may be selecting the most endowed in terms of motivation, innate ability and other correlates of productivity not observed in our data. The effect of this process is not only to improve the quality of the female labour force as a whole but also to reduce the magnitude of the gender wage inequality. Probably as a matter for further research, it would be interesting to see with more complete datasets how the evolution of variables related to the level of motivation and intrinsic ability (such as scores of standardised tests) interact with labour market earnings and their influence on the magnitude of the gender wage gaps over time.

Quite surprisingly, our results indicate that the dissimilar allocation of workers actually helps to reduce the magnitude of the gender wage gap. We learned from our multinomial logit estimates that all segregation indices would be reduced by more than 90 per cent if women were treated as men in the selection process into the five occupational categories defined in this study. However, the decomposition results indicate that the wage gap would be higher in all years if women were distributed across the five occupational categories as implied by our counterfactual estimates. In other words, the way women and men are distributed across all five jobs groups actually helps to reduce, rather than increase the size of thewage gap over all years. This result is potentially affected by the highly aggregated nature of our occupational categorisation. We were not able to properly account for the dynamics of segregation levels in urban Colombia which are visible using a finer categorisation of occupations. Also, the effects of the limited access of women to high salaries positions largely dominated by men are also obscured by our gross categorisation of occupations. However, we believe that our application has its own merits in providing a better framework to correct for selectivity bias and simultaneously incorporate the effects of dissimilar job distributions by gender. There is clearly an opportunity for further research in the development of more efficient algorithms in the estimation of multiple choice models with a better capacity tomodel a large number of occupational outcomes.


NOTAS

1 This type of decomposition uses the male wage structure as the one prevailing in the absence of discrimination and we believe this is a reasonable approach in the context of this empirical application. However, the index number approach pursued here is subject to the conventional "index problem" as it is also possible to use the female wage structure or even the pooled sample wage structure. For a more extensive discussion of this topic, see: Appleton, Hoddinott & Krishnan (1999, p. 289-312).
2 For an excellent review of alternative methods to correct for selectivity bias using multinomial logit models see: Bourguignon, Fournier & Gurgand (2007, p. 174-205).
3 It should be noted that our strategy to correct for selectivity bias based on a multinomial logit approach allows controlling for two sources of bias, one from selection into the labour force and another from the choice of the occupational category to which the worker belongs. This implies that all selection effects commented here in this paper refer to an average worker drawn at random from the sample with a given a set of average characteristics.
4 It should be observed that more recent microdatais available for this country. We restrict our study, however, this time frame as the sampling design of the household survey changed substantially in subsequent years, particularly since 2006 onwards when both, a different sampling methodology and population projections were implemented in this country. Similar exercises with more recent data are indeed possible and we are currently working on that on a new paper.
5 The set of household characteristics included in the model plays an important role in our modelling strategy to control for selectivity bias as they represent the instruments not directly correlated with wages but highly correlated with the probability to participate in different occupational categories.
6 Sampling segments, as the final sampling unit, are compact or continuous sets of ten houses which are surveyed in full. For details on the sample design of the Colombian household survey, see: DANE (2004).
7 Marginal effects for other years are included in Appendix A.
8 There is a major limitation in Colombian microdata as the family relationship of children is undetermined, except in the case of children with female household headship. This cast some doubt in the identification strategy of the selection model implemented in this paper given the scarcity of good instruments (i.e., highly correlated with the probability of participation but uncorrelated with wages) in the household surveys of this country. Unfortunately, this issue is present in most empirical works of the labour market for this country and it certainly deserves more research.
9 A full set of estimates of the wage equations described in expressions (8) and (9) is presented in Appendix B.
For instance, Badel & Peña (2010), using a quintile regression approach arrive at a similar conclusion based on household survey data for 2006.


REFERENCIAS

Acosta, C. E. & Isaza, J. G. (2006). Gender, poverty and labour market dynamics: a microsimulations analysis for urban Colombia between 1990 and 2004. Social Science Research Network -SSRN. Retrieved from http://ssrn.com/abstract=1281545.         [ Links ]

Anker, R. (1997). Theories of occupational segregation by sex: an overview. International Labour Review, 136, 315.         [ Links ]

Appleton, S., Hoddinott, J. & Krishnan, P. (1999). The Gender Wage Gap in Three African Countries. Economic Development and Cultural Change, 47, 289-312.         [ Links ]

Becker, G. (1971). The economics of discrimination. Chicago: University of Chicago Press.         [ Links ]

Blinder, A. S. (1973). Wage discrimination: reduced form and structural estimates. Journal of Human Resources, 8, 436-455.         [ Links ]

Borghans, L. & Groot, L. (1999). Educational presorting and occupational segregation. Labour Economics, 6, 375-395.         [ Links ]

Bourguignon, F., Fournier, M. & Gurgand, M. (2007). Selection bias corrections based on the multinomial logit model: Monte Carlo comparisons. Journal of Economic Surveys, 21, 174-205.         [ Links ]

Brown, R. S., Moon, M. & Zoloth, B. S. (1980). Incorporating Occupational Attainment in Studies of Male-Female Earnings Differentials. The Journal of Human Resources, 15, 3-28.         [ Links ]

DANE (2000). Encuesta nacional de hogares, ciudades y areas metropolitanas, Manual de Crítica y Codificación, Etapa 108. Bogotá: Dirección Técnica de Estadísticas Básicas, DANE.         [ Links ]

DANE (2004). Metodología de la encuesta de hogares. Bogotá: Dirección Técnica de Estadísticas Básicas, DANE. Recuperado de http://suamox03.dane.gov.co:7778/imgcurso/mettran/indice.html.         [ Links ]

Duncan, O. D. & Duncan, B. (1955). A Methodological analysis of segregation indexes. American Sociological Review, 20, 210-217.         [ Links ]

Edwards, S. (2001). The economics and politics of transition to an open market economy: Colombia Paris. Organisation for Economic Co-Operation and Development OECD.         [ Links ]

Grazier, S. & Sloane, P. J. (2007). Accident risk, gender, family status and occupational choice in the UK. Labour Economics, In Press, Corrected Proof.         [ Links ]

Gyourko, J. & Tracy, J. (1988). An analysis of public-and private-sector wages allowing for endogenous choices of both government and union status. Journal of Labor Economics, 6, 229-253.         [ Links ]

Hakim, C. (1992). Explaining trends in occupational segregation: the measurement, causes, and consequences of the sexual division of labour. European Sociological Review, 8, 127-152.         [ Links ]

Heckman, J. J. (1979). Sample selection bias as a specification error. Econometrica, 47, 153.161.         [ Links ]

Isaza, J. G. (2002) Women workers in Bogota's informal sector: gendered impact of structural adjustment policies in the 1990s. Planeación y Desarrollo 33, 411-468.         [ Links ]

Isaza, J. G. & Reilly, B. (2009). Occupational segregation by gender: an empirical analysis for urban Colombia (1986-2004): Brighton, Department of Economics, University of Sussex.         [ Links ]

Isaza, J. G. (2013). Occupational segregation, gender wage differences and trade reforms: empirical applications for urban Columbia. Doctoral thesis, University of Sussex. Retrieved from: http://sro.sussex.ac.uk/44798/.         [ Links ]

Karmel, T. & Maclachlan, M. (1988). Occupational sex segregation-increasing or decreasing? Economic Record, 64, 187.         [ Links ]

Lee, L.-F. (1983). Generalized econometric models with selectivity. Econometrica, 51, 507-512.         [ Links ]

Liu, P.-W., Zhang, J. & Chong, S.-C. (2004). Occupational segregation and wage differentials between natives and immigrants: evidence from Hong Kong. Journal of Development Economics, 73, 395-413.         [ Links ]

Meng, X. & Miller, P. (1995). Occupational segregation and its impact on gender wage discrimination in China's rural industrial sector. Oxford Economic Papers, 47.         [ Links ]

Miller, P. W. (1987). The wage effect of the occupational segregation of women in Britain. The Economic Journal, 97, 885-896.         [ Links ]

Neuman, S. & Silber, J. G. (1996). Wage discrimination across ethnic groups: Evidence from Israel. Economic Inquiry, 34, 648.         [ Links ]

Nunez, J., Ramírez, J. C. & Cuesta, L. (2005). Determinantes de la Pobreza en Colombia, 1996-2004. Documento CEDE, 1-45.         [ Links ]

Núñez, J. & Sánchez, F.(2002). A dynamic analysis of households decision making in Colombia, 1976-1998: changes in household structure, human capital and its returns, and female labor force participation. Documento CEDE, 15, 1-50.         [ Links ]

Oaxaca, R. (1973). Male-female wage differentials in urban labor markets. International Economic Review, 14, 693-709.         [ Links ]

PNUD (2003). El conflicto, callejón con salida (Informe Nacional de Desarrollo Humano Colombia -2003). Bogotá: Programa de Naciones Unidas para el Desarrollo.         [ Links ]

Reilly, B. (1991). Occupational segregation and selectivity bias in occupational wage equations: an empirical analysis using Irish data. Applied Economics, 23, 1.         [ Links ]

Ribero, R. (2003). Gender dimensions of non-formal employment in Colombia. Documento CEDE, 1-84.         [ Links ]

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica, 48,817-838.         [ Links ]

ANEXOS