How do Related and Unrelated Varieties Impact Firm Economic Growth in China: An Empirical Analysis with Endogeneity

Abstract

Using a sample of 84,868 Chinese manufacturing firms operating in the period 2006-2013, we analyze whether related and unrelated variety—measured by entropy method, affects firms’ economic performance. Our results show that, with Heckman (1979) two-step sample selection model correcting only for sample-selection bias, unrelated variety has a negative and statistically significant impact. If we also correct the endogeneity of the main explanatory variables—related and unrelated variety, by employing Bartik’s (1991) shift-share approach, we find that they become insignificant. We find a positive effect for related variety and negative effect for unrelated variety only if we consider highly-developed Chinese regions. Finally, we find that related variety has a positive effect only in the case of large firms.

Share and Cite:

Jiang, Y. (2024) How do Related and Unrelated Varieties Impact Firm Economic Growth in China: An Empirical Analysis with Endogeneity. Advances in Applied Sociology, 14, 269-293. doi: 10.4236/aasoci.2024.146019.

1. Introduction

Spatial agglomeration is one of the main determinants of firms’ economic performance and therefore of regional/local economic growth. Since the early 1990s, the focus in the literature has been on the benefits that firms derive from location in an agglomerated area such as an industrial district or a technological cluster. This stream of work generally shows that the geographic concentration of production tends to generate positive returns in terms of firm productivity (Cainelli et al., 2016; Cainelli & Ganau, 2019) , firm innovation propensity (Cainelli & De Liso, 2005; Cainelli, 2008) , and internationalization choices (Cainelli et al., 2014) .

The literature identifies two main types of local externalities: 1) localization and 2) diversification economies (Glaeser et al., 1992) which depend respectively on the Marshall-Arrow-Romer (MAR) model and Jacobs externalities (Glaeser et al., 1992) . The MAR theory suggests that knowledge spillovers occur among firms that belong to the same industry (localization economies) whereas Jacobs externalities (Jacobs, 1969) refer to the knowledge spillovers that occur among firms belonging to different industries (diversification economies).

A more recent strand of work has introduced the notion of related variety into the concept of Jacobs externalities (Frenken et al., 2007) . In this case what matters is not diversification per se but related variety i.e. knowledge spillovers among firms operating in “different but related” sectors. In other words, inter-industry knowledge spillovers, i.e. the cross-fertilization of ideas, knowledge and technologies across industries, occur among sectors with common knowledge and technology bases.

This paper investigates the effect of related and unrelated variety on economic growth of a sample of 84,868 Chinese manufacturing firms in the period 2006-2013. We estimate a firm economic proportional growth equation à la Gibrat which includes these two measures of spatial agglomeration i.e. related and unrelated variety. According to a commonly accepted interpretation of Gibrat’s Law, the growth rate of a firm is independent of its initial size. Thus, Gibrat’s Law allows us to test whether the firm’s initial size has any effect on its subsequent economic growth. We estimate an augmented version of the equation by adding two explanatory variables for agglomeration effects: related and unrelated variety. This allows us to test whether Gibrat’s Law holds, and to investigate the role played by related and unrelated variety in firms’ economic growth.

This paper contributes to the empirical literature on related variety in three ways. First, it is one of the first works to study an emerging economy such as China. During the years considered—2006-2013—China experienced a deep productive and technological transformation, and accelerated economic growth. Second, it is the first analysis of Gibrat’s Law applied to the Chinese economy which also includes agglomeration measures and investigates the effect of related variety on the economic performance of Chinese firms using a large firm level dataset. Third, it takes account of the endogeneity of the main explanatory variables—related and unrelated variety—by employing Bartik’s (1991) shift-share approach. To our knowledge, no previous work on related and unrelated variety addresses the potential endogeneity of these variables.

The paper is organized as follows. In Section 2 we review the literature on Gibrat’s Law and the concept of related variety. Section 3 describes the data and the econometric methodology adopted. Section 4 presents and discusses the main results of our analysis and Section 5 concludes the work.

2. Related Literature

2.1. Gibrat’s Law

Gibrat’s Law or Gibrat’s rule of proportionate growth was proposed first by Gibrat (1931) . The original version hypothesized that, a skewed distribution consisting in a large number of additive and independent variables could be converted into a normal distribution by transforming the initial variables with a logarithmic function. Following work by Mansfield (1962) and Chesher (1979) , Gibrat’s Law began to be used to test proportional growth theory which is the basis of many empirical analyses. The current interpretation of Gibrat’s Law differs slightly from its original version. Today, Gibrat’s Law is understood as stating that a firm’s proportional growth rate is independent of its absolute size, in other words, all firms, small and large, should grow at the same rate. This understanding of Gibrat’s Law was tested first by Mansfield (1962) on the cases of three different industries. Many subsequent studies (Wagner, 1992; Geroski, 1995; Caves, 1998) provide estimates of this equation. However, in many cases the evidence does not support the Law (Reid, 1995; Audretsch, 1995; Harhoff et al., 1998; Weiss, 1998; Audretsch et al., 1999; Calvo, 2006) .

Some authors hypothesize that rejection of the Law is due to the fact that small firms generally have a high probability of dying. Lotti et al. (2003) use quantile regression to show that Gibrat’s Law holds for new entrants; in other words, estimates based on surviving firms might be affected by sample selection bias which tends to amplify the rapid growth of smaller firms.

For clarity, this paper uses Gibrat’s Law as the starting point for an investigation of firm economic growth in China. Based on the common understanding of Gibrat’s Law, we test whether the rate of firms’ economic growth is independent of its initial income level. Gibrat’s Law generally holds if firm growth is independent of determinants such as firm age and size. It assumes that the estimated coefficients of the firm’s initial income level and firm age are not different from zero (Maine et al., 2010) .

Although Gibrat’s Law provides a useful framework to test proportionate growth, it is not able to capture all the determinants of firm income growth. Therefore, in our specifications we incorporate measures for agglomeration externalities to investigate their effects on firm income growth, and whether Gibrat’s Law holds in estimates that include more variables.

2.2. Agglomeration and Related Variety

Analysis of agglomeration economies goes back to Alfred Marshall (1920) and since that time the idea that location within a bounded geographic area has a positive effect on firms’ economic performance has been acknowledged by all economists. In a study of the cutlery and knitwear districts in Sheffield, and the knitwear district in Northampton, Marshall showed that firms located in these agglomerated areas benefitted from advantages, also called externalities, compared to firms operating in the same industries but located in non-agglomerated areas. Specifically, Marshall identified three mechanisms underlying agglomeration-related advantages: 1) concentration of a large number of highly specialized suppliers (input sharing); 2) availability of highly specialized workers (labor matching); and 3) existence of knowledge spillovers among local actors.

These three mechanisms explain why firms located in an agglomerated area, and benefitting from lower production costs, tend to achieve higher productivity with respect to their non-agglomerated counterparts. More recent empirical contributions generally confirmed this positive relationship between agglomeration and firms’ economic performance (e.g. Henderson, 2003; Martin et al., 2011; Cainelli et al., 2016 ), propensity for innovation (Cainelli & De Liso, 2005; Cainelli, 2008) , and internationalization choices (Cainelli et al., 2014) .

The recent empirical work on agglomeration-related advantages is the seminal paper by Glaeser et al. (1992) , which investigates two different forms of agglomeration forces and their effects on urban employment in the USA: the agglomeration advantages associated to the productive specialization of the local industry; and agglomeration advantages associated to the productive diversification in an urban area. The first type, generally measured by the level of productive specialization of an industry in the locality, captures knowledge spillovers among firms operating in the same industry. The idea is that physical proximity among firms belonging to the same industry facilitates the transmission of knowledge, ideas, information, and technologies among economic agents and promotes both knowledge spillovers among firms, and incremental and process innovations. These local externalities are usually described as localization economies.

The second type of agglomeration advantages, also called Jacobs externalities, consist of the knowledge spillovers across different industries located in the same geographic area. The notion of Jacobs externalities was proposed by Jane Jacobs (1969) in the context of American cities. Jacobs (1969) identified urban variety as one of the key mechanisms supporting and promoting the transfer of ideas, information, and knowledge among the different industries in a city/urban area. The cross-fertilization of information, knowledge, and innovation among firms in different industries generates advantages for individual firms and thus positive effects on the local system’s aggregate economic performance. It has been shown that almost 70% of the innovations developed by one industry are used in another sector (Glaeser et al., 1992) . It follows then that an industry located in an area characterized by a high degree of diversification and variety of the productive structure should grow faster thanks to these spillovers. In fact, the industry structure diversity/variety at the regional level promotes exchange and cross-fertilization of information, ideas, and technologies, which in turn, promote radical and product innovation. These local externalities described as diversification economies.

Debate over which of these two agglomeration forces, localization or diversification, contributes most to the economic development of a region/local system has been ongoing for a long time. Despite the large number of papers on this topic, the empirical literature is not conclusive.

Our understanding about the mechanisms underlying knowledge spillovers arising from local industrial diversification has been improved recently. It has been suggested that what matters is not diversification/variety per se but rather related variety. This assumes that knowledge spillovers within a region/local system occur among firms operating in “different but related” sectors. The literature on related variety (Frenken et al., 2007) shows that industrial variety in a locality is not a sufficient condition to guarantee cross-fertilization and the transfer of information and knowledge among different local industries. Technological/productive similarity among the industries located in a bounded geographic area is required in order to materialize these local externalities (Frenken et al., 2007) . The authors suggest that the transfer of information, knowledge, and innovations can occur only among industries with the same or similar technology and knowledge bases. It is suggested also that the differentiated industrial mix in a local system/region can enhance opportunities to interact, copy, modify, and recombine ideas, practices, and technologies across sectors. Geographic proximity among firms makes this process of recombining existing pieces of knowledge in totally new ways more likely to occur. Recombination leads to new products and services. Traditional industries such as footwear are unlikely to transfer knowledge or technologies to high-tech sectors such as the biomedical industry. Thus, transfer and transmission processes are activated only if the cognitive distance among the firms operating in these different local industries is not too large (Nooteboom, 2000; Boschma & Iammarino, 2009) . In their pioneering contribution, Frenken et al. (2007) operationalized the concept of local industrial variety using the entropy measure, and employed a standard statistical classification of industries to identify relatedness among the sectors within an industry. Specifically, they computed related variety as the weighted sum of entropy at the five-digit sector level for each two-digit level industry in a locality, while unrelated variety was operationalized as entropy at the two-digit industry level.

Many empirical studies focus on the relationship between related variety and regional economic growth. The first is Frenken et al. (2007) which analyzes the Dutch case, and shows positive returns from related variety on employment growth, but not productivity or unemployment growth, at the sub-national geographic NUTS3 level. Brachert et al. (2011) show that related variety was one of the main sources of German regional employment growth during the period 2003-2008. They found that related sectors foster regional economic growth. Firgo and Mayerhofer (2018) studied related variety and employment growth using highly disaggregated sub-regional data for Austria level and found that unrelated variety positively affects employment growth. Mameli et al. (2012) take account of sectoral heterogeneity and show that related variety seems to have a stronger influence on knowledge intensive service sectors compared to manufacturing industries. In an analysis of the impact of related variety in the case of Finland, Hartog et al. (2012) found no evidence of employment growth being affected by related variety. However, after decomposing high-tech and low/medium-tech sectors, they found that related variety had a positive impact on employment growth in high-tech sectors. Bishop and Gripaios (2010) argue that in investigating the effect of related variety on regional employment growth, distinguishing between manufacturing and services might be an oversimplification. Since sectors are heterogeneous, the mechanisms behind knowledge spillovers may differ among sectors. A more recent study by Aarstad et al. (2016) proposes a multi-level analysis of Norwegian data and shows that related industrial variety has a positive effect on firm innovation, and that unrelated variety has a negative effect on productivity.

Studies of China’s economy investigating the effects of related and unrelated variety on firm level performance are scarce although Howell et al. (2018) examine the effects of related and unrelated variety on the survival chances of new entrepreneurial firms in China.

3. The Dataset and the Econometric Methodology

3.1. The Dataset

Our study focuses on firm level economic growth of a sample of Chinese manufacturing firms during the period 2006-2013. Our main source of data is the non-listed enterprise database, which is a micro-database at the enterprise level, and is also the most comprehensive enterprise database. The database is covering the basic information for enterprise with an income (in the main business activity) above 5 million of yuan in China, such as enterprise name, address, contact information, etc; financial information for enterprise, including revenue, profit, assets, liabilities and other financial indicators; production information, such as production capacity, output, capacity utilization, etc; labor supply information, such as number of employees and job distribution; industry category information, geographic location information (which region the enterprise is belonging to), ownership information (state-owned, privately owned, foreign-funded, etc), export status information, historical business data and so on. The database is compiled according to the standards published by the Chinese National Bureau of Statistics. It covers a wide range of economic activities such as the extractive and electricity industry, all manufacturing sectors, and gas and water production.

To construct our dataset we selected two years 2006 and 2013. We excluded observations with missing or invalid information on location (district level), industry category (2-digit level, according to the national industrial classification-2003, National industry classification and code (GB/T 4754—2002) are showed in Appendix Table A1), production, sales, and employment. We are interested in firms in operation both in 2006 and 2013. We dropped firms that started operations after 2006 and firms that changed location during the time period.

This left a final sample of 84,868 observations covering almost all 2-digit manufacturing industries from sector 13 to 43 (such as food manufacturing industry, textile industry, furniture manufacturing industry, medical manufacturing industry, paper and paper products industry, etc), except sectors 16 (Tobacco) and 25 (Oil processing, coking and nuclear fuel processing). The 84,868 sample observations are at firm level, with the variables including legal person code for identifying the exact firm, industry code at 2-digit level and industry code at 4-digit level for identifying the industry category of the firm, number of employees, main business income at year 2006, main business income at year 2013, starting operation year, industrial intermediate input, gross output value, value-added of industry, total fixed assets, total current assets, sales costs, sales revenue, province code, city code, district code (for geographic information of the firm) and so on.

3.2. Measuring Related and Unrelated Variety

We use entropy measures to define related and unrelated variety (Frenken et al., 2007) . The entropy measures drawing from the construction of system entropy in information theory, are usually used to measure the disorder or randomness of a system, and typically used to describe the degree of disorder or uncertainty in a system. The calculation formulation is: H ( X ) = p ( x ) log 1 p ( x ) , where X is the state of system, p ( x ) is the probability of being in system state X. The higher probability of being in a specific state means the lower uncertainty of the overall system. The main advantage of entropy measures is that they can be decomposed at each digit sectoral level. The decomposable nature of entropy measure implies that variety can enter the regression without causing collinearity (Jacquemin & Barry, 1979; Attaran, 1986) . Unrelated variety in each district is computed by entropy at the 2-digit level, related variety is computed by the weighted sum of entropy at the 4-digit level within each 2-digit sector.

More formally, let all four-digit sectors g located in district d at time t = 2006, fall exclusively into a two-digit sector j, where j = 1 , , J . The two-digit shares, P j d t , are the sum of four-digit shares p g d t :

P j d t = g j p g d t (1)

The unrelated variety (UV), or entropy at the two-digit level is given by:

U V d t = j = 1 J P j d t log 2 ( 1 P j d t ) (2)

Related variety (RV), the weighted sum of entropy within each two-digit sector is given by:

R V d t = j = 1 J P j d t × [ g j p g d t P j d t log 2 ( 1 p g d t / P j d t ) ] (3)

According to the clarification of how related and unrelated variety are defined and calculated above, we can say the higher value of unrelated variety means a higher degree irrelevant diversification, the higher value of related variety means a higher degree of relevant diversification.

3.3. Econometric Methodology

We investigate the effect of related and unrelated variety on firm level economic growth in China during the period 2006-2013. To our knowledge, this is the first analysis of Gibrat’s Law in the context of the Chinese economy which also includes agglomeration measures. The firm level economic growth equation is defined as follows:

I n c o m e G r o w t h i j d = α + β log ( I n c o m e i j d t ) + ρ a g e i j d t + v V D i j d t + δ log ( p o p d e n s c t ) + γ R V d t + τ U V d t + ϵ p + θ j + ϑ s + ε i j d t (4)

The dependent variable is firm income growth, which is defined (in logs) as: (T = 2013, t = 2006): I n c o m e G r o w t h i j d = ln ( i n c o m e i j d , T ) ln ( i n c o m e i j d , t ) . This variable measures the economic growth of firm i operating in the 2-digit sector j and located in district d between the year t = 2006 and T = 2013. We use the main business income of each firm to proxy for its income level.

In addition to related variety and unrelated variety, we include explanatory variables for: income level in the initial year 2006 (Incomeijdt), firm age (ageijdt) defined as 2006 minus the firm’s start year, and firm’s level of vertical disintegration (VDijdt) defined as the ratio between industrial intermediate input and gross output value in 2006:

V e r t i c a l D i s i n t e g r a t i o n i j d t = p u r c h a s e d i n t e r m e d i a t e i n p u t i j d t g r o s s o u t p u t v a l u e i j d t (5)

To control for geographic heterogeneity in our main specifications we introduce a measure for population density (popdensct) calculated as the population of city c in 2006 per square-kilometer. This variable, which is in logarithm form, represents for urbanization economies.

ϵ p denotes a set of geographic dummies defined at the provincial level to capture systematic differences across geographic areas in terms of natural resources, public infrastructures, social capital, industrialization and policy efficiency. In China administrative divisions include several levels. Provincial is the first level and includes provinces, autonomous regions, municipalities, and special administrative regions. The second level is the prefectural-city level which includes prefecture-level cities and prefectures. In 2019 there are 34 provincial units (23 provinces, 5 autonomous regions, 4 municipalities, and 2 special administrative regions) and 333 prefectural units. Cities are further sub-divided into districts. Central and local governments have different place-based policies such as tax incentives and public subsidies. Natural resources, technology, education, and health systems can vary significantly across regions.

θ j is a set of industry dummies defined at the 2-digit level. They are introduced to control for productive, organizational, and technological differences. The scale theory points out that firm size influences firm economic growth. Therefore, we include ϑ s which is a set of dummy variables, defined according to firm size (measured as number of employees). Specifically, we consider four size dummies: 1) small firms with 0 - 50 employees; 2) medium-small firms with 50 - 95 employees; 3) medium-large firms with 95 - 200 employees; and 4) large firms with over 200 employees.

Table A2 in Appendix shows Firms and employees distribution at province levels, and Table A3 in Appendix show Firms and employees distribution at 2-digit level. Definition for main variables and Correlation matrix of selected explanatory variables are showed in Tables A4-A6 respectively.

3.4. Identification Strategy

There are two main econometric problems related to our estimation procedure. The first concerns the sample selection and the second the endogeneity of our main explanatory variables: related and unrelated variety.

We estimate our firm level income growth equation adopting a Heckman (1979) two-step sample selection model. We observe income-growth only for the sub-sample firms that survived over the period 2006-2013, which means that if we estimate our main equation only for this sub-sample of firms, the OLS estimates would be biased. There is a clear sample-selection problem. To mitigate this problem, we adopt a two-step sample selection method. To capture the non-random survival of firms during years 2006 and 2013, first we estimate a probit regression for firm survival, with a binary dependent variable which takes value 1 if the firm is observed at both the beginning (t = 2006) and the end of the period (T = 2013) and is 0 otherwise. Then we estimate the augmented firm economic growth equation (4) including the inverse Mills ratio. The firm survival is modeled as an unknown non-linear function (Griffith et al., 2009) on firm size, fixed assets, output value, and total profit (all in logs) for the start year of 2006. These firm level characteristics are suitable excluded variables from the economic growth equation since they affect the probability of firm survival. Since the non-linear functional form determining the firm’s exit decision is unknown, we follow Olley and Pakes (1996) and Pavcnik (2002) and adopt a semiparametric specification which approximates the unknown function with a polynomial expansion in firm size, log fixed assets, log output value, and log total profit, and their interactions.

Summing up, the Heckman model is estimated as follows. First, the probit model is estimated for the whole sample; then the inverse Mills ratio (λ) obtained from the selection equation is added to the economic growth equation (4) as an additional regressor to correct for sample selection bias. Finally, the augmented version of the economic growth regression is estimated using OLS on the sub-sample of firms that survived over the 2006-2013 period. The Heckman two-step method provides unbiased and consistent estimates for the growth equation (4).

Although we corrected for sample selection bias, our estimates might still be biased by the (potential) presence of endogeneity of related and unrelated variety. Endogeneity occurs for several reasons including reverse causality. For example, related and unrelated variety might explain firm economic growth, but firm economic growth might induce a “leadership effect”, which might attract other up-stream and down-stream firms to locate around this high-performing firm. This could lead to clustering/agglomeration processes along the supply chain which might generate related and unrelated variety. Also, exogenous shocks can affect firms’ economic performance and regional industrial distribution simultaneously.

To account for the endogeneity problem, we follow the strategy proposed by Autor and Duggan (2003) , a modification of Bartik’s (1991) shift-share approach. The main idea is that in absence of sector specific or local city level shocks each industrial sector at the local level (in our case the district level) will have experienced the same dynamics (in terms of employment) experienced at the national level over the period 2000-2005. In other words, the instrument variables should exclude any shock associated to China joining WTO in 2001 which would be specific to both the industrial sector and the local area. Accordingly, we construct instruments for related and unrelated variety. IVru accounts for sectoral variations at the four-digit level within the same two-digit sector, and is defined as:

I V r v = g = 1 g j G { ( n g d ( t 6 ) g = 1 g j G n g d ( t 6 ) ) [ log ( n g ( d ) ( t 1 ) ) log ( n g ( d ) ( t 6 ) ) ] } (6)

where n g d ( t 6 ) is the number of employees in the four-digit sector g within a two-digit sector j ( g j ), and located in the district d at time t 6 = 2000 ; the terms n g ( d ) ( t 1 ) and n g ( d ) ( t 6 ) denote the number of employees working in a four-digit sector g at the national level excluding the district d, at the times t 1 = 2005 and t 6 = 2000 respectively. We choose year 2000 (before China’s accession to the WTO) to construct the “share” component in order to capture the original state before the shock (joining the WTO). While the “shift” component is defined for the period 2000-2005, considering a year lag (at 2005) to relax the endogeneity issue. The instrument for related variety specified in equation (6) for each four-digit sector g falling within a two-digit sector j, is used to calculate the employment shares in district d. This means that the share referring to year 2000 for each four-digit sector at the local level changes depending on the district considered. Then during the period 2000-2005 these four-digit sector shares multiplied by the change in employment in the same four-digit sector at national level but excluding the reference district d, are summed over the corresponding 4-digit sectors. Thus, the instrumental variable (IV) captures dynamics which are specific to the four-digit sectors within each two-digit sector for each district.

The instrumental variable (IVuv) accounts for the variation at the unrelated two-digit sector level and is defined as follows:

I V u v = j = 1 J { ( n j d ( t 6 ) j = 1 J n j d ( t 6 ) ) [ log ( n j ( d ) ( t 1 ) ) log ( n j ( d ) ( t 6 ) ) ] } (7)

where j = 1 , 2 , , J denotes two-digit sectors. In this case, for each two-digit sector j located in district d, the IV is defined by calculating the shares in term of number of employees working for that specific two-digit sector j and located in district d among all two-digit sectors’ employment within district d. This share of each two-digit sector, defined for the year 2000 (t − 6 = 2000) changes depending on the district d considered. Again, the “shift” term is calculated as the rate of change in terms of number of employees observed for the same two-digit sector and at the national level, but excluding the district d of reference, during period 2000-2005. Then the “shift” part multiplied by the “share” part we mentioned above, are summed over the corresponding 2-digit sectors, which constructs our instrument for unrelated variety, capturing the dynamics which are particular to each two-digit sector and each district.

Finally, we adopt the method proposed by Wooldridge (2010) to solve this estimation addressing sample selection and endogeneity simultaneously. First, we estimate a reduced-form selection equation using a probit model with the set of external instrumental variables (IVrv, IVuv) and the non-linear form exclusion restriction added to the exogenous variables entering Equation (4), and excluding the two endogenous variables (RVjct, UVjct). Second, we estimate firm economic growth equation via a Two-Stage Least Square (2SLS) regression with the inverse Mills ratio obtained from the first-stage selection model as an additional regressor. In addition, we also estimate equation (4) with a Generalized Method of Moments (GMM) approach for a comparison. Standard errors are clustered at the district level in all the specifications, which allows the error term to be correlated across the firms in each district (Bertrand et al., 2004) .

The endogeneity of related and unrelated variety is tested using the Durbin χ2 statistic and the Cragg-Donald Wald F statistic (Wooldridge, 2010) . The null hypothesis is that the variables are exogenous. Our test rejects this hypothesis in the case of both specifications. This means that these two variables are endogenous in the equation. According to the weak identification test, the Wald F statistic is above the rule of thumb value of 10 in all specifications. Thus, in our case we have no weak instruments in our case.

To check whether these results hold for different sized firms, we split the full sample into four sub-samples according to the firm size (already defined). Also we focus on firms located in well-developed regions i.e. the top three Chinese regions based on the value of their gross regional product in the start year 2006.

4. Empirical Results and Findings

4.1. OLS Estimation Results with Correcting the Sample Selection

Table 1 reports the OLS estimates of our income growth equation correcting for the sample selection bias. In all specifications bootstrapped standard errors are clustered at the district level to allow the error term to be correlated across the firms in each district. Column (1) presents the results of our baseline specification without either related or unrelated variety. Columns (2) and (3) respectively show the results for the specifications with related and unrelated variety. Column (4) includes both variables.

Table 1. Sectoral variety and firm economic growth—OLS Method.

Note: *p < 0.1, **p < 0.05, ***p < 0.01. Bootstrapped Standard errors clustered at district level are shown in parentheses. All specifications include provincial geographic dummies and 2-digit industrial dummies and a constant term. lambda denotes the Inverse Mills Ratio from the selection equations.

From the results in Table 1, the inverse Mills ratio (lambda) is negative and statistically significant indicating that we need to correct for sample selection. Besides, column (2) and column (3) shows the strong negative effects of related variety and unrelated variety on firm’s income growth when introducing them in the model respectively. Column (4) suggests only negative effects of unrelated variety on firm’s income growth holds the significance when introducing the both variables, related variety and unrelated variety.

The negative impact of unrelated variety on firm’s income growth, which is in line with previous work about the effects of related and unrelated variety on regional employment growth (Frenken et al., 2007; Saviotti & Frenken, 2008) , could be considered as it may lead to resource fragmentation and can not promote effective knowledge exchange and collaborative innovation. The negative impacts of related variety on firm’s economic growth is different with some previous works (Saviotti & Frenken, 2008; Boschma & Iammarino, 2009; Boschma & Frenken, 2012; Mameli et al., 2012) which related variety usually has positive effects on employment growth, value-added growth and labor productivity. The reason for this could be that related variety may lead to excessive consumption of resources (resource stress), such as land, water resources, energy, etc., which can increase production costs and affect firm’s income growth.

Firm income level at the start year of the analysis has a negative effect on income growth; the estimated coefficients of the variable log(Incomeijdt) are negative and statistically significant in all the specifications, and the estimated elasticities are similar at around 30%. Since the estimated coefficient of the firm income level in the initial year is different from 0, we can state that the Gibrat’s Law does not hold for the Chinese case. Smaller firms grow at a different rate from larger firms. We find that the impacts of firm age and vertical disintegration are negative and significant. This confirms Gibrat’s theory of unproportional increases in firms’ income. Among firms that survived during the financial crisis, the larger the firm the worse their income performance: the estimated coefficients of medium, medium-large and large firms are all negative and highly statistically significant. Moreover, the magnitude of the coefficients increases with firm size. This finding is not surprising. Small firms which survived during the economic crisis generally experienced fewer shocks to their economic performance compared to larger firms. This might be that small firms tend to be less internationalized, and thus less exposed to external shocks than larger firms which are generally more involved in international activities. For this reason, they were less hit by the financial crisis and have had a better economic performance.

4.2. IV Estimation Results with Correcting the Endogeneity Issue

Results of the IV estimation correcting for endogeneity are reported in Table 2. Column (1) reports the results of the IV Two-Stage-Least-Square method, column (2) presents the results of the IV-GMM. Both methods provide quite similar results after 500 replications of the bootstrapped standard errors. The results suggest that after correcting for the endogeneity of related and unrelated variety these variables do not have a significant effect on firm level economic growth.

Table 2. Sectoral variety and firm economic growth—IV Method.

Note: *p < 0.1, **p < 0.05, ***p < 0.01. Bootstrapped standard errors clustered at district level show are in parentheses. All specifications include provincial geographic dummies and 2-digit industrial dummies and a constant term. Lambda denotes the Inverse Mills Ratio from the selection equations.

The estimated negative and significant coefficient for the inverse Mills ratio (lambda) suggests the need to correct for sample selection. We employ the Durbin χ2 statistic to test for endogeneity of related and unrelated variety. Our null hypothesis is rejected: these variables are not exogenous. The values of the Cragg-Donald Wald F statistic are higher than the conservative cut-off value of 10 in both specifications. The sign of the estimated coefficients of unrelated variety remains negative (as in the OLS estimates) and confirms previous studies (Frenken et al., 2007) . The magnitude of these coefficients is larger at around 0.05 compared to 0.025 in the OLS estimations. The negative sign in the former case shows that the OLS estimations are upward biased.

As mentioned before, exogenous shocks can affect firms’ economic performance and regional industrial distribution simultaneously. Thus, after correcting endogeneity with Bartick shift-share approach, the impacts of related and unrelated variety on firm’s income growth become insignificant. Besides, another aspect can be considered to induce this result. The two variables for agglomeration—related and unrelated variety—may be associated with the control variables—geographic dummies and firm size dummies—through the infrastructure level, talent supply, industry chain collaboration, market size and potential, international integration level, and policy supports—China’s economy contains the tradition of government intervention. Industrial policy plays important roles in different stages of China’s economic development, especially in developed regions. For further study, we do the sub-sample analysis for different firm size and for firms in highly developed regions. .

4.3. Robustness Check Results for Different Firm Size and Highly Developed Regions

The results in Table 1 and Table 2 allow us to distinguish between agglomeration externalities generated by different but related sectors and by totally different and unrelated sectors for the whole sample. This allows as to identify which is relatively more important for income growth across the firms in a localized industry. For further robustness check, we do the sub-sample analysis for different firm size and for firms in highly developed regions.

Table 3 reports the IV-TSLS estimates for different size firms. The estimated coefficients vary widely between groups. Inverse Mills ratios are positive for small and medium-small firms, but negative for medium-large and large firms. Related variety is significantly positive only for large firms which can include state-owned firms and subsidiaries of multinational companies. Nevertheless, we can say that these units often have stronger networking, R&D and resource integration capabilities, seem to be more able to absorb knowledge and be better benefit from technological advances from related sectors, as they are able to translate new technologies into innovative products or services more quickly, thus to transform these external flows into higher economic performance. Unrelated variety is not significant for all sub-groups which is in line with the results

Table 3. Sub-sample estimation for firm size heterogeneity (IV-TSLS).

Note: *p < 0.1, **p < 0.05, ***p < 0.01. Standard errors clustered at district level are shown in parentheses. Industry dummy are constructed at 2-digit industrial level, Geographic dummy are constructed at province level.

for the whole sample. Firm level characteristics such as firm income level in the initial year, age, and vertical disintegration are significant for medium-large and large firms.

We ran a sub-sample analysis on the firms in top three most developed regions based on the regional gross product value in 2006. The results are presented in Table 4. The negative and significant coefficients for inverse Mills ratio indicate the need for sample selection. However, in contrast to the result for the whole sample, related variety has a positive and statistically significant impact on firm-level economic growth. The economic interpretation of this result is interesting. The industry related variety in a locality is not a sufficient condition to guarantee cross-fertilization and transfer of information, knowledge, and technologies among different local industries. For the mechanisms underlying

Table 4. Sub-sample estimation for top 3 developed regions (IV-TSLS).

Note: *p < 0.1, **p < 0.05, ***p < 0.01. Standard errors clustered at district level are shown in parentheses. Industry dummy are constructed at 2-digit industrial level, Geographic dummy are constructed at province level.

inter-industry spillovers to work requires that the region is characterized also by a high level of economic and technological development. This finding confirms previous empirical studies which show that related variety is more effective in a high-tech sector context. When we consider highly developed Chinese regions, unrelated variety also becomes significant. We find that unrelated variety has a negative and significant effect on firm economic growth. Finally, firm specific characteristics such as income level in the start year 2006, age, and vertical disintegration are all negative and significant.

5. Conclusion

We used a sample of 84,868 Chinese manufacturing firms during the period 2006-2013 to test Gibrat’s Law based on the joint presence of related and unrelated variety (Frenken et al., 2007) . Our results reject Gibrat’s Law even if we include agglomeration externalities. We analyzed the roles played by related and unrelated variety in firm-level economic growth and found that unrelated variety has a significant and negative effect on firm economic growth if we correct for sample selection bias. After correcting for endogeneity, both related and unrelated varieties were insignificant. We consider this to be an interesting result raising questions about the findings of related variety in studies that do not correct for these econometric problems.

We obtained some interesting results when we disaggregate our dataset according to firm size and region of location. In the case of large firms, related variety seems to have a significant positive effect. This is also true for firms located in highly-developed Chinese regions where related variety significantly and positively influences firm economic performance, while unrelated variety has a negative impact. Confirming previous work we found that local knowledge spillovers are more easily absorbed by firms operating in similar and related sectors. Our results seem to confirm that these types of inter-industry knowledge spillovers are the main drivers of economic growth only among Chinese manufacturing firms in highly-developed regions and not in rural and less-industrialized regions. We showed that large firms, most likely state-owned firms or subsidiaries of multinational companies, seem to have more capacity to transform the opportunities in their local environment into higher economic performance.

Our research has some limitations. First, the analysis is based on a cross-section. Panel datasets would allow us to conduct a more sophisticated analysis of the dynamics of firms’ economic growth by taking account of time lags. Second, our sample period is short, 2006-2013, and includes the period of the Great Recession which coincided also with significant changes of the Chinese economy.

To conclude, we believe that our study contributes to the empirical literature on the relationship between agglomeration and firms’ economic performance in the context of a developing country China. We found no strong effects of related variety and unrelated variety on firm level economic growth after correcting endogeneity. According to our findings, it seems to rely mainly on firm and regional specific characteristics.

Appendix

Table A1. National industry classification and code (GB/T 4754—2002).

Table A2. Firms and employees distribution at provincial level.

Note: This is firm distribution for original dataset.

Table A3. Firms and employees distribution at 2-digit level.

Table A4. Definition for main variables.

Table A5. Correlation matrix of selected explanatory variables.

Table A6. Descriptive statistics of dependent and continuous explanatory variables.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Aarstad, J., Kvitastein, O. A., & Jakobsen, S. E. (2016). Related and Unrelated Variety as Regional Drivers of Enterprise Productivity and Innovation: A Multilevel Study. Research Policy, 45, 844-856.
https://doi.org/10.1016/j.respol.2016.01.013
[2] Attaran, M. (1986). Industrial Diversity and Economic Performance in US Areas. The Annals of Regional Science, 20, 44-54.
https://doi.org/10.1007/BF01287240
[3] Audretsch, D. B. (1995). Innovation, Growth and Survival. International Journal of Industrial Organization, 13, 441-457.
https://doi.org/10.1016/0167-7187(95)00499-8
[4] Audretsch, D. B., Santarelli, E., & Vivarelli, M. (1999). Start-Up Size and Industrial Dynamics: Some Evidence from Italian Manufacturing. International Journal of Industrial Organization, 17, 965-983.
https://doi.org/10.1016/S0167-7187(98)00002-2
[5] Autor, D. H., & Duggan, M. G. (2003). The Rise in the Disability Rolls and the Decline in Unemployment. Quarterly Journal of Economics, 118, 157-206.
https://doi.org/10.1162/00335530360535171
[6] Bartik, T. J. (1991). Who Benefits from State and Local Economic Development Policies? W.E. Upjohn Institute for Employment Research.
https://doi.org/10.17848/9780585223940
[7] Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How Much Should We Trust Differences-in-Differences Estimates? Quarterly Journal of Economics, 119, 249-275.
https://doi.org/10.1162/003355304772839588
[8] Bishop, P., & Gripaios, P. (2010). Spatial Externalities, Relatedness and Sector Employment Growth in Great Britain. Regional Studies, 44, 443-454.
https://doi.org/10.1080/00343400802508810
[9] Boschma, R., & Frenken, K. (2012). Technological Relatedness and Regional Branching. In Beyond Territory (pp. 64-81). Routledge.
[10] Boschma, R., & Immarino, S. (2009). Related Variety, Trade Linkages, and Regional Growth in Italy. Economic Geography, 85, 289-311.
https://doi.org/10.1111/j.1944-8287.2009.01034.x
[11] Brachert, M., Kubis, A., & Titze, M. (2011). Related Variety, Unrelated Variety and Regional Functions: Identifying Sources of Regional Employment Growth in Germany from 2003 to 2008 (No. 15/2011). IWH Discussion Papers.
[12] Cainelli, G. (2008). Industrial Districts: Theoretical and Empirical Insights. In C. Karlsson (Ed.), Handbook of Research on Cluster Theory (pp. 189-202). Edward Elgar Pubblishing.
https://doi.org/10.4337/9781848442849.00019
[13] Cainelli, G., & De Liso, N. (2005). Innovation in Industrial Districts: Evidence from Italy. Industry and Innovation, 12, 383-398.
https://doi.org/10.1080/13662710500195991
[14] Cainelli, G., & Ganau, R. (2019). Related Variety and Firm Heterogeneity. What Really Matters for Short-Run Firm Growth? Entrepreneurship & Regional Development, 31, 768-784.
https://doi.org/10.1080/08985626.2019.1571636
[15] Cainelli, G., Di Maria, E., & Ganau, R. (2014). An Explanation of Firms’ Internationalisation Modes, Blending Firm Heterogeneity and Spatial Agglomeration: Microevidence from Italy. Environment and Planning A, 46, 943-962.
https://doi.org/10.1068/a4690
[16] Cainelli, G., Ganau, R., & Iacobucci, D. (2016). Do Geographic Concentration and Vertically Related Variety Foster Firm Productivity? Micro-Evidence from Italy. Growth and Change, 47, 197-217.
https://doi.org/10.1111/grow.12112
[17] Calvo, J. L. (2006). Testing Gibrat’s Law for Small, Young and Innovating Firms. Small Business Economics, 26, 117-123.
https://doi.org/10.1007/s11187-004-2135-5
[18] Caves, R. E. (1998). Industrial Organization and New Findings on the Turnover and Mobility of Firms. Journal of Economic Literature, 36, 1947-1982.
[19] Chesher, A. (1979). Testing the Law of Proportionate Effect. Journal of Industrial Economics, 27, 403-411.
https://doi.org/10.2307/2097961
[20] Firgo, M., & Mayerhofer, P. (2018). (Un)related Variety and Employment Growth at the Sub-Regional Level. Papers in Regional Science, 97, 519-547.
https://doi.org/10.1111/pirs.12276
[21] Frenken, K., Van Oort, F., & Verburg, T. (2007). Related Variety, Unrelated Variety and Regional Economic Growth. Regional Studies, 41, 685-697.
https://doi.org/10.1080/00343400601120296
[22] Geroski, P. A. (1995). What Do We Know about Entry? International Journal of Industrial Organization, 13, 421-440.
https://doi.org/10.1016/0167-7187(95)00498-X
[23] Gibrat, R. (1931). Les Inégalits économiques. Sirey.
[24] Glaeser E. L., Kallal H. D., Scheinkman J. A., & Shleifer A. (1992). Growth in Cities. Journal of Political Economy, 100, 1126-1152.
https://doi.org/10.1086/261856
[25] Griffith, R., Redding, S., & Simpson, H. (2009). Technological Catchup and Geographic Proximity. Journal of Regional Science, 49, 689-720.
https://doi.org/10.1111/j.1467-9787.2009.00630.x
[26] Harhoff, D., Stahl, K., & Woywode, M. (1998). Legal Form, Growth and Exit of West German Firms—Empirical Results for Manufacturing, Construction, Trade and Service Industries. Journal of Industrial Economics, 46, 453-488.
https://doi.org/10.1111/1467-6451.00083
[27] Hartog, M., Boschma, R., & Sotarauta, M. (2012). The Impact of Related Variety on Regional Employment Growth in Finland 1993-2006: High-Tech versus Medium/Low-Tech. Industry and Innovation, 19, 459-476.
https://doi.org/10.1080/13662716.2012.718874
[28] Heckman, J. J. (1979). Sample Selection Bias as a Specification Error. Econometrica, 47, 153-161.
https://doi.org/10.2307/1912352
[29] Henderson, J. V. (2003). Marshall’s Scale Economies. Journal of Urban Economics, 53, 1-28.
https://doi.org/10.1016/S0094-1190(02)00505-3
[30] Howell, A., He, C., Yang, R., & Fan, C. C. (2018). Agglomeration, (un)-Related Variety and New Firm Survival in China: Do Local Subsidies Matter? Papers in Regional Science, 97, 485-500.
https://doi.org/10.1111/pirs.12269
[31] Jacobs, J. (1969). The Economy of Cities. Vintage Books.
[32] Jacquemin, A. P., & Berry, C. H. (1979). Entropy Measure of Diversification and Corporate Growth. The Journal of Industrial Economics, 27, 359-369.
https://doi.org/10.2307/2097958
[33] Lotti, F., Santarelli, E., & Vivarelli, M. (2003). Does Gibrat’s Law Hold among Young, Small Firms? Journal of Evolutionary Economics, 13, 213-235.
https://doi.org/10.1007/s00191-003-0153-0
[34] Maine, E. M., Shapiro, D. M., & Vining, A. R. (2010). The Role of Clustering in the Growth of New Technology-Based Firms. Small Business Economics, 34, 127-146.
https://doi.org/10.1007/s11187-008-9104-3
[35] Mameli, F., Iammarino, S., & Boschma, R. (2012). Regional Variety and Employment Growth in Italian Labour Market Areas: Services versus Manufacturing Industries. CIMR Research Working Paper.
[36] Mansfield, E. (1962). Entry, Gibrat’s Law, Innovation, and the Growth of Firms. American Economic Review, 52, 1023-1051.
[37] Marshall, A. (1920). The Economics of Industry. Macmillan and Company.
[38] Martin, P., Mayer, T., & Mayneris, F. (2011). Spatial Concentration and Plant-Level Productivity. Journal of Urban Economics, 69, 182-195.
https://doi.org/10.1016/j.jue.2010.09.002
[39] Nooteboom, B. (2000). Learning and Innovation in Organizations and Economies. Oxford University Press.
https://doi.org/10.1093/acprof:oso/9780199241002.001.0001
[40] Olley, S., & Pakes, A. (1996). The Dynamics of Productivity in the Telecommunications Equipment Industry. Econometrica, 64, 1263-1297.
https://doi.org/10.2307/2171831
[41] Pavcnik, N. (2002). Trade Liberalization, Exit, and Productivity Improvements: Evidence from Chilean Plants. Review of Economic Studies, 69, 245-276.
https://doi.org/10.1111/1467-937X.00205
[42] Reid, G. C. (1995). Early Life-Cycle Behaviour of Micro-Firms in Scotland. Small Business Economics, 7, 89-95.
https://doi.org/10.1007/BF01108684
[43] Saviotti, P. P., & Frenken, K. (2008). Export Variety and the Economic Performance of Countries. Journal of Evolutionary Economics, 18, 201-218.
https://doi.org/10.1007/s00191-007-0081-5
[44] Wagner, J. (1992). Firm Size, Firm Growth, and Persistence of Chance: Testing GIBRAT’s Law with Establishment Data from Lower Saxony, 1978-1989. Small Business Economics, 4, 125-131.
https://doi.org/10.1007/BF00389853
[45] Weiss, C. R. (1998). Size, Growth, and Survival in the Upper Austrian Farm Sector. Small Business Economics, 10, 305-312.
https://doi.org/10.1023/A:1007972518380
[46] Wooldridge, J. M. (2010). Econometric Analysis of Cross Section and Panel Data. MIT Press.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.