{"id":1487,"date":"2016-03-05T15:14:55","date_gmt":"2016-03-05T20:14:55","guid":{"rendered":"http:\/\/whiskyanalysis.com\/?p=1487"},"modified":"2016-06-22T14:50:46","modified_gmt":"2016-06-22T18:50:46","slug":"so-you-want-to-be-a-whisky-reviewer","status":"publish","type":"post","link":"https:\/\/whiskyanalysis.com\/index.php\/2016\/03\/05\/so-you-want-to-be-a-whisky-reviewer\/","title":{"rendered":"So You Want to Be a Whisky Reviewer &#8230;"},"content":{"rendered":"<p>One of the first questions that comes up when someone is considering becoming a product reviewer is whether or not to provide a score &#8211; and if so, over what sort of range?<\/p>\n<p>As discussed on my <a href=\"http:\/\/whiskyanalysis.com\/index.php\/interesting-correlations\/interpretation-understanding-flavour-commentaries\/\" target=\"_blank\">Flavour Commentaries<\/a> page, providing a score or rating is hardly required in a product review. I personally avoid doing this in my <a href=\"http:\/\/flashlightreviews.ca\" target=\"_blank\">flashlight reviewing<\/a> (in part because technology is always advancing there). But if you are interested in scoring, you might find the personal observations (and data) from integrating whisky reviewer scores on this site interesting.<\/p>\n<p><strong>Scoring Systems<\/strong><\/p>\n<p>Most whisky reviewers tend to provide some sort of quality ranking. As explained on my <a href=\"http:\/\/whiskyanalysis.com\/index.php\/methodology-introduction\/methodology-understanding-reviewer-scoring\/\" target=\"_blank\">Understanding Reviewer Scoring<\/a> page, at its heart scoring is simply a way to <em>rank<\/em> the relative quality of all the products a given reviewer has sampled. As long as you are only looking within the catalog of reviews of that one reviewer, it doesn&#8217;t necessarily matter what category labels they are using for their rank.<\/p>\n<p>A numerical score from 1-100? Fine. Star ratings from 1 to 5, with half-stars? No problem. Six gradations of recommended levels? Sure. Kumquats widths from 2.1cm to 3.3cm in 0.25cm increments? Okay, if that floats your boat. Personally, I&#8217;d love to see someone review to base hexadecimal (&#8220;Man, this limited edition is much better than the regular OB version &#8211; I&#8217;ll have give it an 0E&#8221;).<\/p>\n<p>One problem with the diversity of scoring systems is that it may be hard to get a feel for how items compare to each other for a given reviewer &#8211; until you go through her whole catalog of reviews. Similarly, it would be hard to integrate the reviews of multiple reviewers on a given site (or across sites). This has led to some consolidated approaches for standardization. In the liquor industry, probably the most popular one is that developed by Robert Parker for <a href=\"https:\/\/www.erobertparker.com\/info\/legend.asp\" target=\"_blank\">scoring wines<\/a>.<\/p>\n<p>In this system, all wines receive a numerical whole number score between 50 and 100. The presumption is that anything below 50 is unfit for human consumption (i.e., swill). 50-59 is not recommended. 60-69 is below average. 70 to 79 is average. 80 to 89 is above average. 90 to 95 is outstanding. And 96-100 is extraordinary (and rare).<\/p>\n<p>The benefit to this system is it is fairly easy to understand and relate to. Unfortunately, it still leads to a lot of variation in interpretation by different individuals &#8211; as shown graphically for whisky reviewers on my <a href=\"http:\/\/whiskyanalysis.com\/index.php\/methodology-introduction\/methodology-understanding-reviewer-scoring\/\" target=\"_blank\">Understanding Reviewer Scoring<\/a> page.<\/p>\n<p>Still, if you were starting out as a reviewer, this isn&#8217;t a bad system to work from, as it provides a recognizable structure. But fundamentally, it is no better or worse than any other scoring system. From the perspective of someone running a meta-critic integration site, I can tell you it doesn&#8217;t really matter what you choose to use as scores\/labels &#8211; what really matters is your consistency in using them.<\/p>\n<p><strong>Score distributions<\/strong><\/p>\n<p>Consistency of scoring actually encompasses a number of things. Is the reviewer applying scores in as fair a manner as possible across categories? Would the same product get the same score if sampled on another occasion?\u00a0 In other words, is the reviewer showing good <em>internal consistency<\/em> in their scoring?<\/p>\n<p>Few reviewers do repeated testing of the same sample (and almost none with <a href=\"https:\/\/en.wikipedia.org\/wiki\/Blind_experiment\" target=\"_blank\">blinding<\/a>), so it is hard to know. Whiskies are also subject to considerable batch variations (for some of the reasons <a href=\"http:\/\/whiskyanalysis.com\/index.php\/background\/scotch-style-whiskies-single-malts-vs-blends\/\" target=\"_blank\">discussed here<\/a>), which further complicates matters if the repeated sampling is done on different batches. I recommend you check out my <a href=\"http:\/\/whiskyanalysis.com\/index.php\/methodology-introduction\/methodology-biases-and-limitations\/\" target=\"_blank\">Review Biases and Limitations<\/a> page a discussion of some of the common pitfalls here.<\/p>\n<p>But one way to address this consistency issue in the aggregate is to compare the distribution pattern of scores across reviewers. This is part of the larger <a href=\"http:\/\/whiskyanalysis.com\/index.php\/methodology-introduction\/methodology-metacritic-score-construction\/\" target=\"_blank\">correlational<\/a> analyses that I did in building the Meta-Critic database.<\/p>\n<p>The key points that I want to share here &#8211; as a guide for newcommers to reviewing &#8211; are:<\/p>\n<ul>\n<li>whisky reviewers do not hand out scores in an evenly distributed manner<\/li>\n<li>whisky reviewers are fairly consistent in how they deviate from a normal distribution<\/li>\n<\/ul>\n<p>The above is true of all the whisky reviewers examined here, including those ostensibly using the Parker wine scoring scheme. As explained on my <a href=\"http:\/\/whiskyanalysis.com\/index.php\/methodology-introduction\/methodology-understanding-reviewer-scoring\/\" target=\"_blank\">Understanding Reviewer Scoring<\/a> page, all reviewers <em>skew left<\/em> in their distributions. This is shown graphically below in the frequency histogram of the Meta-Critic scores:<\/p>\n<p><img decoding=\"async\" class=\"size-full wp-image-1418 aligncenter lazyload\" data-src=\"http:\/\/whiskyanalysis.com\/wp\/wp-content\/uploads\/2016\/02\/Distribution-MCAverage.gif\" alt=\"Distribution-MCAverage\" width=\"650\" height=\"445\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 650px; --smush-placeholder-aspect-ratio: 650\/445;\" \/><\/p>\n<p>In essence, you can interpret this distribution as pretty close to what the &#8220;average&#8221; or typical reviewer in my dataset looks like.\u00a0 Again, see that earlier page for some examples of actual reviewers.<\/p>\n<p>Note that I choose to present the Meta-Critic score using a standard scientific notation of one significant digit to the left of the decimal. Those who remember using <a href=\"http:\/\/www.sliderule.ca\/\" target=\"_blank\">slide rules<\/a> will be able to relate. \ud83d\ude42\u00a0 Just multiply everything by 10 if you want to know what it would look like on the Parker scale.<\/p>\n<p>Below are the current actual descriptive characteristics of the Meta-Critic score distribution.<\/p>\n<p>Mean: 8.53<br \/>\nMedian: 8.58<br \/>\nStandard Deviation: 0.41<br \/>\nSkewness: -0.63<br \/>\nMinimum: 6.93<br \/>\nMaximum: 9.52<\/p>\n<p>While the Parker scoring system provides a nice idealized normal distribution in theory (i.e., min of 50, max of 100 and an average of 75) &#8211; in practice most reviewers deviate from it considerably.\u00a0 I suspect <a href=\"https:\/\/en.wikipedia.org\/wiki\/Grade_inflation\" target=\"_blank\">grade inflation<\/a> has a lot to do with this, along with a desire to please readers\/suppliers.\u00a0 But whatever the reasons, it is a common observation that all whisky reviewers seem to fit the above pattern.<\/p>\n<p>So if you are starting out as a reviewer, you may want to consider trying to match your scores to a similar distribution &#8211; just so that your readers will have an easier time understanding your reviews in the context of others out there. Of course, nothing is stopping you from breaking the mold and going your own way.\u00a0 \ud83d\ude09<\/p>\n<p><strong>Range of Whiskies<\/strong><\/p>\n<p>The other thing I see a lot is reviewers &#8220;revising&#8221; their score range over time &#8211; which can be a problem if they have a lot of old scores to &#8220;correct&#8221;.<\/p>\n<p>The source of the problem seems to be a sampling bias when they start out reviewing, and have limited experience of only budget to mid-range products. As they start reviewing higher-end products, they realize they are too &#8220;squished&#8221; in their scoring to be properly proportional.\u00a0 For example, if you start out giving one of the most ubiquitous (and cheap) single malts like the Glenlivet 12 a 90+ score, that doesn&#8217;t leave you much room to maneuver as you start sampling higher quality single malts.<\/p>\n<p>To help new reviewers calibrate themselves, here are how some of the more common expressions typically fall within the Meta-Critic Score, broken down by general category. Note that I&#8217;m not suggesting you bias your scores by what the consensus thinks below &#8211; but I just want to give you an idea of what the general range is out there for common whiskies that you are likely to have tried.<\/p>\n<p><strong>~7.5 whiskies<\/strong><br \/>\nBourbon-like: Jim Beam White, Rebel Yell, Ancient Age<br \/>\nRye-like: Crown Royal, Canadian Club<br \/>\nScotch-blend-like: Johnnie Walker Red, Cutty Sark, Ballantine&#8217;s Finest, Famous Grouse<br \/>\nSingle-Malt-like: (there aren&#8217;t many that score this low)<\/p>\n<p><strong>~8.0 whiskies<\/strong><br \/>\nBourbon-like: Jack Daniels&#8217;s Old No. 7, Jim Beam Devil&#8217;s Cut, Wild Turkey 81<br \/>\nRye-like: Royal Canadian Small Batch, Gibson&#8217;s Finest 12yo, Templeton Rye<br \/>\nScotch-blend-like: Chivas Regal 12yo, Jameson Irish Whiskey, Teacher&#8217;s Highland Cream, Black Grouse<br \/>\nSingle-Malt-like: Glenfiddich 12yo, Glenlivet 12yo, Glenrothes Select Reserve, Tomatin 12yo<\/p>\n<p><strong>~8.5 whiskies<\/strong><br \/>\nBourbon-like: Wild Turkey 101, Basil Hayden\u2019s, Bulleit Bourbon, Four Roses Small Batch<br \/>\nRye-like: Knob Creek Small Batch Rye, Canadian Club 100% Rye, George Dickel Rye, Forty Creek Barrel Select<br \/>\nScotch-blend-like: Johnnie Walker Blue, Johnnie Walker Black, Green Spot, T\u00e9 Bheag<br \/>\nSingle-Malt-like: Old Pulteney 12yo, Glenmorangie 10yo, Dalmore 12yo, Ardmore Traditional Cask<\/p>\n<p><strong>~9.0 whiskies<\/strong><br \/>\nBourbon-like: Russell&#8217;s Reserve, Maker&#8217;s Mark 46, Booker&#8217;s Small Batch, W.L. Weller 12yo<br \/>\nRye-like: Lot 40, Masterson&#8217;s Straight Rye 10yo, Whistlepig 10yo<br \/>\nScotch-blend-like: (Not much makes it up to here, maybe Ballantine&#8217;s 17yo, Powers 12yo John&#8217;s Lane)<br \/>\nSingle-Malt-like: Aberlour A&#8217;Bunadh, Amrut Fusion, Ardbeg 10yo, Talisker 10yo<\/p>\n<p><strong>Close to ~9.5 whiskies<\/strong><br \/>\nBourbon-like: Various Pappy van Winkles, some BTACs, George T. Stagg<br \/>\nRye-like: High West Midwinter Night\u2019s Dram Rye (closest Canadians: Wiser&#8217;s Legacy, Gibson&#8217;s 18yo)<br \/>\nScotch-blend-like: nada<br \/>\nSingle-Malt-like: Lagavulin 16yo, Brora 30yo, Caol Ila 30yo, Redbreast 21yo<\/p>\n<p>As an aside, you may notice that some whisky categories get consistently higher or lower scores than others. As a result, I suggest you try to avoid directly comparing scores across categories (e.g. bourbons vs single malts), but focus instead on internal consistency within categories. This is why the <a href=\"http:\/\/whiskyanalysis.com\/index.php\/database\/\" target=\"_blank\">Whisky Database<\/a> is sorted by default by general category (and then flavour profile, if available), before sorting by score.<\/p>\n<p><img decoding=\"async\" class=\"alignright size-full wp-image-1499 lazyload\" data-src=\"http:\/\/whiskyanalysis.com\/wp\/wp-content\/uploads\/2016\/03\/Kumquat.jpg\" alt=\"Kumquat\" width=\"400\" height=\"278\" data-srcset=\"https:\/\/whiskyanalysis.com\/wp\/wp-content\/uploads\/2016\/03\/Kumquat.jpg 400w, https:\/\/whiskyanalysis.com\/wp\/wp-content\/uploads\/2016\/03\/Kumquat-300x209.jpg 300w\" data-sizes=\"(max-width: 400px) 100vw, 400px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 400px; --smush-placeholder-aspect-ratio: 400\/278;\" \/>Again, the above is just a way to help you calibrate yourself against the &#8220;typical&#8221; reviewer (as expressed by the Meta-Critic score).\u00a0 Nothing is stopping you from going your own way.<\/p>\n<p>But if anyone does decide to use kumquat widths as category labels, please drop me a line &#8211; I&#8217;d love to hear about it. \ud83d\ude42<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>One of the first questions that comes up when someone is considering becoming a product reviewer is whether or not to provide a score &#8211; and if so, over what sort of range? As discussed on my Flavour Commentaries page, providing a score or rating is hardly required in a product review. I personally avoid doing this in my flashlight<\/p>\n","protected":false},"author":1,"featured_media":1497,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4,15],"tags":[6],"class_list":["post-1487","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-stats","category-whisky-resources","tag-statistics"],"_links":{"self":[{"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/posts\/1487","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/comments?post=1487"}],"version-history":[{"count":14,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/posts\/1487\/revisions"}],"predecessor-version":[{"id":2081,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/posts\/1487\/revisions\/2081"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/media\/1497"}],"wp:attachment":[{"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/media?parent=1487"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/categories?post=1487"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/whiskyanalysis.com\/index.php\/wp-json\/wp\/v2\/tags?post=1487"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}