CreamScan Research Evaluation Standards
Intro
Fundamental conflict of interest – does cosmetic science explore or sell?
There is a discussion even among cosmetic chemists on a matter of how scientific a cosmetic science is today and should be.Here are a few quotes from one of the leaders of cosmetic chemistry industry, Dr. Johann W. Wiechers from his book “Memories of a Cosmetically Disturbed Mind” (22)."… Especially because cosmetic science is a commercial science. Why? Academic science is done for finding out how 'things' work, and to test or deny a hypothesis, whereas commercial science is science done for finding reasons for selling your product."
"A large proportion of us are doing science to sell products and not to explain things. Cosmetic science is therefore very often comparative science (how good is my product or my ingredient doing relative towards my competitor’s product or ingredient) instead of explanatory science (how does my product work).”So we conclude here that, cosmetic science, being an integral part of cosmetic industry, focused to sell new molecules to manufacturers or products to the end consumers, by default, has a built-in bias to demonstrate their efficacy.
Objectives
Does a particular research paper prove that an active ingredient reduced wrinkles?
If yes, by how many percent?
1. Study should confirm efficacy on humans, not animals
2. Topical application of emulsion, no injections, microneedles, no oral intake or combined
2. In-vivo, not ex vivo or in vitro
“The problem is that usually these lab studies do not translate to positive effects when used in real-life products. Just because an ingredient shows a benefit when applied directly to human skin cells in a petri dish does not mean it will have any effect when delivered directly to the skin from a lotion or other personal care product.”
3. At least 20 people
4. Vehicle (placebo) or positive controlled
Looking study by study we confirm, that out of 12 studies, in 11 studies (90%+) vehicle only had a positive effect on wrinkles (3-11).
In 6 studies out of 12, vehicle had a stronger effect on wrinkles, than active only, calculated as the result of "vehicle + active formulation minus result of vehicle only formulation (3-8).
On average, vehicle only (=moisturizer) could improve wrinkles by 12%. Vehicle + active could improve wrinkles by 19%. That means that on average, the effect of the vehicle itself (12% improvement) was 1.5 times stronger than an average effect of actives isolated from vehicle (7% improvement).
How the basic moisturizer can works against wrinkles?
We will discuss it in more details elsewhere to elaborate the fact that a basic moisturizer is the most efficient part of the anti-wrinkle strategy.The key hypothesis is that “use of skin moisturizers should delay or slow down the rate of persistent wrinkling, perhaps by plasticizing the stratum corneum (upper skin layer), thereby diminishing the formation of temporary wrinkles during facial expression and thus the potential for persistent wrinkles. Beyond prevention, lessening the repeated mechanical stress may even allow repair of persistent wrinkles that have already formed” – that was a one of the conclusions in a 8 year (!) study to find out how expression lines transform into persistent wrinkles (18) also confirmed by other studies. Now let's go back now to the anti-wrinkle efficacy. How can we isolate the effect of the active from the effect of the vehicle (placebo) moisturizer? There is only one way to do it. Compare the results of vehicle + active vs. vehicle alone.So studies claiming that a cream with a magic ingredient improved wrinkles by x% vs baseline, are inconclusive, if they don't compare a treatment formulation to the same formulation, but without an active: vehicle (placebo). Because the wrinkle reduction effect can be caused not by a reviewed active ingredient, but by other moisturizing ingredients present in the formula.Positive control
A few words about positive control. We see it as a possible, although less reliable alternative to a vehicle control. It is less reliable because in a double-blind study, in case with vehicle (placebo) control, a researcher doesn't know what result he looks at, a vehicle or an active, so he has no room for potential bias.In case with positive control, however, a researcher, even if blinded, has room for bias. Because to prove that the reviewed ingredient has the same efficacy as positive control, both formulas can be given the same efficacy evaluation, in case the subjective evaluation method is used (like expert grading of wrinkle severity). For that a researcher doesn't need to know what result he is looking at, he just may give the same evaluation to all the results and at the end we can claim that ingredient X has the same efficacy as tretinoin, for example.Therefore we only take into consideration positive control, if there is no subjective method used to evaluate the results of the participants, but only an objective and quantitative method like computer grading or profilometry.Positive Control Ingredient Should Be Present at Effective Concentrations
Another strategy employed by manufacturers involves using an positive control ingredient with proven efficacy, such as retinol, but at a concentration significantly below the effective range confirmed by other clinical studies. The rationale behind this approach is clear- to claim that the researched ingredient has “the same” efficacy as the proven one and to make statements such as: “In a Preliminary Study on Human Subjects, a Cosmetic Cream Containing a Harungana madagascariensis Plant Extract Induces Similar Anti-Aging Effects to a Retinol-Containing Cream” (24).
In the study (24) arranged by Clarins, two formulations were compared: one containing Harungana madagascariensis Plant Extract, a botanical ingredient used in the well-known Clarins Double Serum, and the other serving as a positive control with 0.05% retinol, a recognized potent anti-wrinkle ingredient. According to the study, both formulations demonstrated similar anti-wrinkle efficacy.
However, the issue arises from the lack of evidence supporting the efficacy of retinol at a concentration of 0.05%, as demonstrated by vehicle-controlled studies. Clinical studies confirm the efficacy of retinol within a concentration range of 0.1% to 1%, with a clear positive correlation between concentration and efficacy.
It is likely that such a low concentration of retinol was intentionally selected to "prove" that Harungana madagascariensis Plant Extract possesses equivalent anti-wrinkle efficacy to retinol, as stated in the conclusions of the research paper. In reality, the reported effects on wrinkles could likely be attributed purely to the emollient/moisturizing properties of both formulations.
Comparability
The last part is about comparability of a vehicle or positive control. As we said above, the whole point of a vehicle control is to exclude the anti-wrinkle effect of other ingredients in a formula. The only way to achieve that is to compare the same formulations, one with active and another one without it, or, in case with a positive control, with another active.We assume that such an approach is always the case, even if exact active and vehicle formulations are not reported. But in some studies it's obvious that the vehicle is different from the active formulation not only in active ingredient content.For example in a key study for anti-wrinkle efficacy of adenosine (20), it is said “A placebo cream known from previous studies to have no effect on wrinkles was used as control.” Even if ignore the fact that this previous study is not specified here, why do we think that using a completely different vehicle formulation (even with proven zero efficacy on the wrinkles) is fundamentally wrong?Because it's not about the efficacy of a vehicle, we agree here that it can be zero. It's about the efficacy of other ingredients in a formulation containing 0.1% of adenosine. How do we know that this formulation reduced wrinkles (no doubt about it) because of adenosine and not because of glycerin, mineral oil and other moisturizing ingredients it could have contained?There is no way to know that unless we also see the results of exactly the same formula with glycerin, mineral oil but without adenosine.It may be obvious that the active formulation and a vehicle should be used the same way, the same number of times per day, on the same part of the face (in case of the spit face study, on the symmetrical part of the face).But again, we see studies where it's not the case: otherwise a perfect study, that met all our criteria, confirming the anti-wrinkle efficacy of bakuchiol (21) on par with retinol, failed on delivering the equal and comparable conditions for an active and its positive control. According to the study design “the subjects were instructed either apply 0.5% retinol cream to their full face nightly or the 0.5% bakuchiol cream to their full face twice daily as a thin layer.”So we compare here 2 exactly the same formulas with only difference of active ingredients, which is great. But why one should be used twice a day and the other one just once? Is it because of bakuchiol that bakuchiol containing formula achieved the same results as the retinol one or because it was used twice more often and delivered better moisturizing reducing wrinkles? No way to know…In conclusion, we found that around a half of the studies reviewed, claiming anti-wrinkle efficacy are not vehicle/positive controlled. We don't talk here about rare ingredients but actually about the well known ones like trendy peptides or ascorbic acid derivatives. So that was a big exclusion factor.5. Double-blind
6. Randomized
7. Monotherapy - only one active with known concentration per formulation
For example, in a vehicle-controlled, double-blind, randomized study for Copper Tripeptide-1 (23) that meets all other criteria, the concentration of the peptide used was not reported.
Normally, peptides are used in tiny quantities, so we could have assumed a concentration of 0.1%. However, if a larger concentration was used, it would not be a fair and correct assumption.
8. Conclusive results of the study
Results should be measured by a researcher, not by a participant. Method of evaluation can be objective (profilometry, computer image evaluation) or subjective, in this case by trained experts/dermatologists (wrinkle severity gradual evaluation), but we don't take into consideration studies where participants' self-assessment of wrinkle improvement is the only result reported.
As Johann W. Wiechers wrote in his book Memories of a Cosmetically Disturbed Mind “Questionnaires can be leading, sometimes even misleading. Tell a person s/he is testing antiwrinkle products and any product will work (to some extent).” (22)
Result should clearly report wrinkle reduction: various parameters like wrinkle depth, surface roughness (Ra, Rz), wrinkle volume, clinical assessment score improvement, etc. Some studies just report a percentage of participants that improved wrinkling or overall photodamage score.
Result should be reported as mean score of a group of all participants and not just for some individual participants. It is not rare when effects are reported for a few subjects and then they are called “typical”.
Result should be reported as percentage of wrinkle reduction vs baseline or we should be able to calculate it with the data reported. It's often a case when results are reported as change of a certain wrinkle parameter vs. baseline (before treatment) in absolute value without precising what baseline value was. In this case we understand that there was a certain effect, we may even make a conclusion this effect was superior to vehicle (placebo), but we can't understand how significant it was.
For example, if a study reports that average skin roughness improved by 10 points, it can be a significant 20% improvement if the baseline value before treatment was 50 or it can be only 5% improvement if the baseline value was 200.
Both vehicle and active ingredient formulation numbers are reported. We need to understand how the active formula performed vs. vehicle (see why in Chapter 4). Often, in a vehicle -controlled studies, some results are reported only for an active formulation. Results of vehicle (placebo) either are not reported at all or reported without any numbers saying something like “It (active) was statistically significant while vehicle was not”.
It doesn't necessarily mean that the researched ingredient is ineffective, probably it showed some level of efficacy, it's just we don't know what exactly, based on the data provided - because a researcher team didn't make it transparent when reported the results.
The other examples when data is reported only for active treatment in a format “improvement vs. baseline vs. placebo”. It may seem a valid format, but there is an important nuance and we've seen it in the studies.
Let's say an active formula reduced wrinkles by 5%. At the same time subjects using placebo (vehicle) formulation, got their wrinkles worse by 10%. It's quite unusual as normally vehicle (placebo) has a positive effect on wrinkles, but in a few studies we've seen this case, probably because of the low moisturizing properties of the vehicle.
So the researchers motivated to demonstrate the efficacy of an active treatment, report that an active has decreased the wrinkles by 15% vs. vehicle, which is mathematically correct. It's just the wrinkle were reduced only by 5% vs. the baseline of the active formulation and that's the number we should take as the real efficacy result of this ingredient. That's why we need to see a transparent reporting of both results for active and vehicle.
9. Peer reviewed
Conclusion
"Caucasian female subjects (n = 50, aged 40–60 years) participated in a 12-week, double-blind, placebo-controlled, split-face, left-right randomized clinical study assessing two topical products: moisturizer control product versus the same moisturizer product containing 5% niacinamide. Niacinamide was well tolerated by the skin and provided significant improvements versus control in end points evaluated previously: fine lines/wrinkles, hyperpigmentation spots, texture, and red blotchiness."
We reviewed more than a hundred ingredients marketed by a skincare industry and raw material manufacturers as having anti-wrinkle effects or potential. Only 11 ingredients have scientific papers that qualify to this rule and we can say yes – they have a proven efficacy confirmed by a robust research.
About the author
References
Khneizer et al. Placebo Use in Medicine: Use, Abuse & Therapy. Mo Med. 2022 Nov-Dec; 119(6): 524–528. PMCID: PMC9762224. PMID: 36588633
Natalia K. Spierings. Evidence for the Efficacy of Over-the-counter Vitamin A Cosmetic Products in the Improvement of Facial Skin Aging: A Systematic Review.
Green et al. A clinicopathological study of the effects of topical retinyl propionate cream in skin photoageing. Clin Exp Dermatol. 1998 Jul;23(4):162-7. PMID: 9894360
Robinson et al. Topical palmitoyl pentapeptide provides improvement in photoaged human facial skin. Int J Cosmet Sci. 2005 Jun;27(3):155-60. PMID: 18492182
Lee et al. Evaluation of the effects of a preparation containing asiaticoside on periocular wrinkles of human volunteers. Int J Cosmet Sci. 2008 Jun;30(3):167-73. PMID: 18452433
Humbert et al. Topical ascorbic acid on photoaged skin. Clinical, topographical and ultrastructural evaluation: double-blind study vs. placebo. Exp Dermatol. 2003 Jun;12(3):237-44. PMID: 12823436
Heinrich et al. In vivo assessment of Ectoin: a randomized, vehicle-controlled clinical trial. Randomized Controlled Trial Skin Pharmacol Physiol. 2007;20(4):211-8. PMID: 17519560
Badenhorst et al. Effects of GHK-Cu on MMP and TIMP Expression, Collagen and Elastin Production, and Facial Wrinkle Parameters.
Grove et al. Skin replica analysis of photodamaged skin after therapy with tretinoin emollient cream. J Am Acad Dermatol. 1991 Aug;25(2 Pt 1):231-7. PMID: 1918458
Creidi et al. Profilometric evaluation of photodamage after topical retinaldehyde and retinoic acid treatment. J Am Acad Dermatol. 1998 Dec;39(6):960-5. PMID: 9843009
Leyden et al. Treatment of photodamaged facial skin with topical tretinoin.
Bissett et al. Topical niacinamide reduces yellowing, wrinkling, red blotchiness, and hyperpigmented spots in aging facial skin. Int J Cosmet Sci. 2004 Oct;26(5):231-8. PMID: 1849213
Kawada et al. Evaluation of anti-wrinkle effects of a novel cosmetic containing niacinamide. J Dermatol. 2008 Oct;35(10):637-42. PMID: 19017042
Mariko et al. Evaluation of the anti-wrinkle effect of a lipophilic pro-vitamin C derivative, tetra-isopalmitoyl ascorbic acid. J Cosmet Dermatol. 2022 Aug;21(8):3503-3514. PMID: 34910367
Weinstein et al. Topical tretinoin for treatment of photodamaged skin. A multicenter study. Arch Dermatol. 1991 May;127(5):659-65. PMID: 2024983
Tucker-Samaras et al. A stabilized 0.1% retinol facial moisturizer improves the appearance of photodamaged skin in an eight-week, double-blind, vehicle-controlled study. TJ Drugs Dermatol. 2009 Oct;8(10):932-6. PMID: 19852122
Lowe et al. Topical tretinoin improves the appearance of photo damaged skin. Australas J Dermatol. 1994;35(1):1-9. PMID: 7998893
Hillebrand GG, Liang Z, Yan X, Yoshii T. New wrinkles on wrinkling: an 8-year longitudinal study on the progression of expression lines into persistent wrinkles. Br J Dermatol 2010; 162: 1233–1241.
Kwon et al. The Effect of Glycerin, Hyaluronic Acid and Silicone Oil on the Hydration, Moisturization and Transepidermal Water Loss in Human Skin.
Abella. Evaluation of anti-wrinkle efficacy of adenosine-containing products using the FOITS technique. Int J Cosmet Sci. 2006 Dec;28(6):447-51.PMID: 18489289
S Dhaliwal et al. Prospective, randomized, double-blind assessment of topical bakuchiol and retinol for facial photoageing. Br J Dermatol. 2019 Feb;180(2):289-296. PMID: 29947134
Johann W. Wiechers. Memories of a Cosmetically Disturbed Mind. Alluredbooks, January 1, 2013. ASIN: B00CY1EC0W
Badenhorst et al. Effects of GHK-Cu on MMP and TIMP Expression, Collagen and Elastin Production, and Facial Wrinkle Parameters.
Gibielle et al. In a Preliminary Study on Human Subjects, a Cosmetic Cream Containing a Harungana madagascariensis Plant Extract Induces Similar Anti-Aging Effects to a Retinol-Containing Cream. Clin Cosmet Investig Dermatol. 2023; 16: 1051–1058. PMID: 37155516
Bissett et al. Topical niacinamide reduces yellowing, wrinkling, red blotchiness, and hyperpigmented spots in aging facial skin. Int J Cosmet Sci. 2004 Oct;26(5):231-8. PMID: 1849213