金融时报报道Piketty书中的数据存在问题

Dracula 发表于 2014-5-24 20:18:35

Piketty的书最近极热，也引来了不少争议，但一般都是关于他对未来资本主义贫富差距的预测和政策建议。几乎一致的看法是他收集数据的工作非常了不起，比如我转过的Larry Summers的书评，尽管对他的结论持反对态度，却认为他在数据上的工作足以获得诺贝尔奖。但是，金融时报今天头版的消息就是Piketty收集的财富分布数据有问题。这里描述的错误如果被证实的话，不管是有心还是无意，对Piketty来说都是很大的尴尬。现在有的评论已经将其同去年Reinhart-Rogoff的问题相类比。

下面是原文。

May 23, 2014 7:00 pm
Thomas Piketty’s exhaustive inequality data turn out to be flawed
By Chris Giles and Ferdinando Giugliano

http://www.ft.com/intl/cms/s/0/c9ce1a54-e281-11e3-89fd-00144feabdc0.html?siteedition=intl#axzz32dHkmr7K

Thomas Piketty is in no doubt that data underpin the conclusions of his best selling economics book, “Capital in the Twenty-First Century” .

He writes, in the introduction: “Compared with previous works, one reason why this book stands out is that I have made an effort to collect as complete and consistent a set of historical sources as possible in order to study the dynamics of income and wealth distribution over the long run”.

While the conclusions of his work, including his call for an international wealth tax, have stirred controversy among academics, commentators and policy makers, even his critics have generally praised the ambition and quality of the data presented in the text.

Reviewing the book this month, Lord Mervyn King, former governor of the Bank of England, said, “the principal weakness of the book is that the carefully assembled data do not live up to Piketty’s rhetoric about the nature of capitalism”.

The sense of diligence in Professor Piketty’s compilation of trends in wealth is bolstered by an online technical annex and spreadsheets containing the data, with sources.

An investigation by the Financial Times, however, has revealed many unexplained data entries and errors in the figures underlying some of the book’s key charts.

These are sufficiently serious to undermine Prof Piketty’s claim that the share of wealth owned by the richest in society has been rising and “the reason why wealth today is not as unequally distributed as in the past is simply that not enough time has passed since 1945”.

After referring back to the original data sources, the investigation found numerous mistakes in Prof Piketty’s work: simple fat-finger errors of transcription; suboptimal averaging techniques; multiple unexplained adjustments to the numbers; data entries with no sourcing, unexplained use of different time periods and inconsistent uses of source data.

Together, the flawed data produce long historical trends on wealth inequality that appear more comprehensive than the source data allows, providing spurious support to Prof Piketty’s conclusion that the “central contradiction of capitalism” is the inexorable concentration of wealth among the richest individuals.

Once the data are cleaned and simplified the European results do not show any tendency towards rising wealth inequality after 1970.

The US source data are also too inconsistent to draw a single long series. But when the individual sources are graphed, none of them supports the view that the wealth share of the top 1 per cent has increased in the past few decades. There is some evidence of a rise in the top 10 per cent wealth share since 1970.

The FT uncovered several types of defect.

One apparent example of straightforward transcription error in Prof Piketty’s spreadsheet is the Swedish entry for 1920. The economist appears to have incorrectly copied the data from the 1908 line in the original source.

A second class of problems relates to unexplained alterations of the original source data. Prof Piketty adjusts his own French data on wealth inequality at death to obtain inequality among the living. However, he used a larger adjustment scale for 1910 than for all the other years, without explaining why.

In the UK data, instead of using his source for the wealth of the top 10 per cent population during the 19th century, Prof Piketty inexplicably adds 26 percentage points to the wealth share of the top 1 per cent for 1870 and 28 percentage points for 1810.

A third problem is that when averaging different countries to estimate wealth in Europe, Prof Piketty gives the same weight to Sweden as to France and the UK – even though it only has one-seventh of the population.

There are also inconsistencies with the years chosen for comparison. For Sweden, the academic uses data from 2004 to represent those from 2000, even though the source data itself includes an estimate for 2000.

Prof Piketty’s documents explaining his sources and methods, suggest that he uses similar data from death duty records around the world. In fact, he interchanges between such source material and surveys of the living, which often give very different answers. Switching between the two sorts of data series, particularly for the US is important to his results.

Some of the biggest defects relate to the UK data, where his original sources consistently show very large declines of near 10 percentage points in wealth held by the rich in the highly inflationary 1970s.

Conversely, Prof Piketty shows the super rich held a greater share of wealth by 1980 and the top 10 per cent saw their share fall only 1.5 percentage points.

The official data series that Prof Piketty says he used for the UK after 1980 shows little increase in inequality over the next 30 years, while his figures show a steep rise.

下面是Piketty对金融时报的回复。我觉得他没有正面回复金融时报指出的数据错误，只是强调其它的研究得出了和他类似的结论。不过同其他的经济学关于数据的争议（比如Steven Levitt，Daron Acemoglu，Carolyn Hoxby）相比，我觉得Piketty讨论自己错误的风度还是不错的。

Piketty response to FT data concerns

Dear Chris,

I am happy to see that FT journalists are using the excel files that I have put on line! I would very much appreciate if you could publish this response along with your piece.

Let me first say that the reason why I put all excel files on line, including all the detailed excel formulas about data constructions and adjustments, is precisely because I want to promote an open and transparent debate about these important and sensitive measurement issues (if there was anything to hide, any “fat finger problem”, why would I put everything on line?).

Let me also say that I certainly agree that available data sources on wealth are much less systematic than for income. In fact, one of the main reasons why I am in favor of wealth taxation and automatic exchange of bank information is that this would be a way to develop more financial transparency and more reliable sources of information on wealth dynamics (even if the tax was charged at very low rates, which you might agree with).

For the time being, we have to do with what we have, that is, a very diverse and heterogeneous set of data sources on wealth: historical inheritance declarations and estate tax statistics, scarce property and wealth tax data, and household surveys with self-reported data on wealth (with typically a lot of under-reporting at the top). As I make clear in the book, in the on-line appendix, and in the many technical papers I have published on this topic, one needs to make a number of adjustments to the raw data sources so as to make them more homogenous over time and across countries. I have tried in the context of this book to make the most justified choices and arbitrages about data sources and adjustments. I have no doubt that my historical data series can be improved and will be improved in the future (this is why I put everything on line). In fact, the “World Top Incomes Database” (WTID) is set to become a “World Wealth and Income Database” in the coming years, and we will put on-line updated estimates covering more countries. But I would be very surprised if any of the substantive conclusion about the long run evolution of wealth distributions was much affected by these improvements.

For instance, my US series have already been extended and improved by an important new research paper by Emmanuel Saez (Berkeley) and Gabriel Zucman (LSE). This work was done after my book was written, so unfortunately I could not use it for my book. Saez and Zucman use much more systematic data than I used in my book, especially for the recent period. Also their series are constructed using a completely different data source and methodology (namely, the capitalisation method using capital income flows and income statements by asset class). The main results are available here: http://gabriel-zucman.eu/files/SaezZucman2014Slides.pdf.

As you can see by yourself, their results confirm and reinforce my own findings: the rise in top wealth shares in the US in recent decades has been even larger than what I show in my book.

In the attached graph, I compare their series with the approximate series that I provide in the book. As you can see by yourself, the general historical profiles are very similar. This is exactly what I expect as we collect more data in other countries as well: we will certainly improve upon my series and adjustments (some of which can certainly be discussed), but I don’t think this will have much of an impact on the general findings.

(see also this paper pp. 91-92 of pdf: http://gabriel-zucman.eu/files/PikettyZucman2014HID.pdf)

Finally, let me say that my estimates on wealth concentration do not fully take into account offshore wealth, and are likely to err on the low side. I am certainly not trying to make the picture look darker than it it. As I make clear in chapter 12 of my book (see in particular table 12.1-12.2), top wealth holders have apparently been rising a lot faster average wealth in recent decades, at least according to the wealth rankings published in magazines such as Forbes. This is true not only in the US, but also in Britain and at the global level (see attached table). This is not well taken into account by wealth surveys and official statistics, including the recent statistics that were published for Britain. Of course, as I make clear in my book, wealth rankings published by magazines are far from being a perfectly reliable data source. But for the time being, this is what we have, and what we have suggests that the concentration of wealth at the top is rising pretty much everywhere. Of course, if the FT produces statistics and wealth rankings showing the opposite, I would be very interested to see these statistics, and I would be happy to change my conclusion! Please keep me posted.

Best, Thomas

海天发表于 2014-5-24 21:43:55

呵呵,前两天有人劝我关注下面这个,给了联结和节选,还夸了一通作者Martin Feldstein

http://online.wsj.com/news/articles/SB10001424052702304081804579557664176917086

{:1_1:}

Dracula 发表于 2014-5-24 22:58:47

海天发表于 2014-5-24 21:43 static/image/common/back.gif
呵呵,前两天有人劝我关注下面这个,给了联结和节选,还夸了一通作者Martin Feldstein

http://online.wsj.com ...

多谢推荐。Feldstein的诸多成就之一就是研究收入税对taxable income的影响。在这个问题上，他知道的应该比Picketty深入。我读过一些关于未来诺贝尔经济学奖获得者的猜测，Feldstein的名字一般都在讨论当中。

另外，Paul Krugman 对金融时报文章的反应也挺有意思。

http://krugman.blogs.nytimes.com/2014/05/24/is-piketty-all-wrong/

Reinhart-Rogoff去年出事的时候，Krugman是一阵穷追猛打，甚至认为他们是故意作假，欺骗。而现在Picketty的数据问题则是无关紧要。他的态度确实有虚伪的成分。

另外我在Economist上读到，Picketty书的法语版去年出版的时候，在法国几乎没有什么反响，部分原因是Picketty曾经是Hollande的经济顾问，Hollande 有点臭名昭著的 75% 收入税的政策就是出自他手。但是一年后这本书的英语版却在美国引起这么大反应，尽管美国的公共意见比法国右很多。这一点我觉得也挺有意思。

页: [1]

爱吱声's Archiver

金融时报报道Piketty书中的数据存在问题