Trust me, I am a Robo-advisor

Scherer, Bernd; Lehner, Sebastian

doi:10.1057/s41260-022-00284-y

Trust me, I am a Robo-advisor

Original Article
Published: 29 October 2022

Volume 24, pages 85–96, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Asset Management Aims and scope Submit manuscript

Trust me, I am a Robo-advisor

Download PDF

1585 Accesses
3 Citations
Explore all metrics

Abstract

This paper offers cross-sectional and data-intensive insights into Robo-advisory portfolio structures. For this purpose, we scrape portfolio recommendations for 16 German Robo-advisors. Our sample accounts for about 78% of assets in the German Robo-advisory market. We analyze about 243.000 pairs of recommended portfolios and their corresponding client characteristics. Our results show that current Robo-advice offers limited individualization. Variables that matter in modern portfolio choice like the amount and nature (beta) of human capital or shadow assets are largely ignored. Instead, portfolio recommendations are designed to meet investor preconceptions or the regulator’s understanding of portfolio choice. While ensuring consumer trust and regulatory approval makes business sense, it also limits the economic benefits of Robo-advisors.\(^1\)

Quant Models for Robo-Advisors

What Do Robo-Advisors Recommend? - An Analysis of Portfolio Structure, Performance and Risk

Robo Advisors: quantitative methods inside the robots

Article Open access 27 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Robo-advisory firms promise to provide low-cost access to diversified portfolios built following the academic literature on normative portfolio choice. Their competitive advantage is based on the ability to provide cheap access to diversified and customized beta (in modern words: financial inclusion). Customization should come at little marginal costs for a web-based platform. Traditional financial advisors have a poor track record for taking client characteristics into account. Foerster et al. (2017) find that only 12% of the cross-sectional variation in advice (across clients) arises from differences in client characteristics such as risk aversion, wealth, experience, occupation or time horizon. Mullainathan et al. (2012) show that advisors are systematically biased against passive investments and even ignore stated client preferences. Traditional financial advice suffers from agency conflicts and behavioural biases.^{Footnote 1} It is also costly (high fixed costs) and might not be available to investors with little wealth. This is often viewed as a major reason for household non-participation in financial markets.

All the above favours Robo-advice over traditional advice. However, Robo-advisor firms suffer from one key vulnerability: the difficulty of creating trust. To deflect this weakness, they make particular design choices. They offer passive funds and ETFs as well as automated portfolio solutions to avoid conflicts of interest (and save production costs). What else can Robo-advisors do to create trust? We believe that the low level of individualization in Robo-advice critically raised by Faloon and Scherer (2017)is not a design flaw but a deliberate design choice to create trust by offering familiar solutions close to popular investment rules.

Our paper offers statistical insights into portfolio recommendations for the German Robo-advisory market by web-scraping 16 Robo-advisors with a cumulative market share of 78%. We find little evidence for individualization of portfolio advice as investor heterogeneity arising from different investor balance sheets or differences in amount and characteristic (market or factor return) of investor human capital are largely ignored. Robo-advisors fail to offer the advice Merton (1971) gave exactly 50 years ago: allocate between speculative demand (frontier portfolios, identical to all investors), cash, and various hedging demands reflecting household balance sheets and exposures to systematic economic risks (different across individual investors). We believe these choices are not made because of ignorance of the existing academic literature but for commercial reasons. Complicated models that can deliver contra-intuitive solutions to the financially untrained client will not maximize revenues in a highly competitive market.

The existing literature on Robo-advice lacks cross-sectional evidence on empirical portfolio structures. Due to the lack of data, most papers review the economics of the industry as in Soehnke et al. (2020), Grealish and Kolm (2021) or Torno et al. (2021), while Puhle (2019) looks at the relative performance of different Robo-advisors. Scherer and Lehner (2021) are closest to us in methodology but only scrape a single “representative” US advisor. Even though they extract more than 150.000 portfolios, it is unclear how their results generalize in the cross section. Torno and Schildmann (2020) also analyze a large cross section of Robo-advisors (36), but rely on six different model customers. This leaves them with 216 (6 times 36) recommended portfolios instead of more than 240000 in our setting. This contrasts with our approach, where each data point represents a unique combination of questionnaire inputs and portfolio recommendations. We have as many different customers as we have data points. In addition, our focus on a single jurisdiction (identical regulatory framework and client preferences) results in the first data-intensive, cross-sectional study on portfolio structures offered by Robo-advisory firms. Finally, Tertilt and Scholz (2018) also investigate the question how different questionaire answers relate to recommended equity allocations. The authors document that many questions asked in questionnaires have no impact on portfolio recommendations. They use a similar set of Robo-advisors but rely on bivariate correlations (between recommendation and questionnaire input) without controlling for other questionnaire items, use a limited sample of variations rather that all possible permutations and do not attempt to answer the question of which set of questions are the most influential (variable importance relative to all other variables).

Our paper is structured as follows. Section 2 describes the sample of Robo-advisors involved in our empirical work and as a summary of questionnaire information required from potential customers. In Sect. 3, we describe the set of portfolios offered by each Robo-advisor and discuss whether these portfolios are consistent with modeling client circumstances from first principles. We then link questionnaire themes (e.g. time horizon, wealth, experience, ... ) with normative portfolio choice theory in order to a assess the importance of each question on the cross section of portfolio recommendations in Sect. 4. Section 5 describes our empirical strategy and presents the main results. We conclude in Sect. 6.

Robo-advisors and questionnairs

The Robo-advisory market in Germany is highly fragmented with about 30 competing firms.^{Footnote 2} The initial list of firms included Bevestor, Cominvest, Easyfolio, Evergreen, Fidelity, Financery, Fintego, Gerd Kommer Invest, Ginmon, Growney, Investify, Invoya, Liqid, Loni, Minveo, My si, Navigator, Onvest, Oskar, Pax-Bank, Pixit, Peaks, Peningar, Quirion, Raisin, Robin, Scalable Capital, Solidvest, Pixit, Truevest, Visualvest, Vividam, Whitebox, Zeedin. We only include advisors that can be systematically scrapped, i.e. we checked each Robo-advisor to see if it was possible to use a script programmed in Python to fill out the questionnaire that leads to a portfolio recommendation. For this purpose, we used one of two methods:

1.
API (application programming interface): For communication with the web server, we used the direct programming interface. That means we send a POST request to the Robo-Advisor server, which is normally sent by the web browser. POST means that the server accepts the data contained in the request message, in this case the predefined input parameters. The response from the server was a portfolio recommendation.
2.
Python library selenium Selenium. It opens a browser window that can be controlled by another Python script. The questionnaire is accessible through certain fields in the source code of the website using the xpath method. The result is the same as if we filled in the questionnaire by hand.

All Robo-advisors that allow scraping were included in the sample. This resulted in our focus list of 16 advisors summarized in Table 1.

How representative is our data for the German market in terms of Assets under management (AuM)? AuM numbers are notoriously difficult to get with many firms being very reluctant to share their numbers. This is not surprising as low AuM numbers signal low customer levels of trust in a given advisor. All AuM data are estimates derived from public sources. Where we did not find sources, we follow Deloitte (2016) and assumed 50 million ï?‘œ AuM as a default, as this represents the minium size to breakeven from the Rob-advisor’s objective. In summary, we cover 8.1 billion in AuM. This leads to a total market size of 11.2 billion by adding the 18 mandates that could not get scrapped. We assume they have on average the same size as the list of firms in table 1. However for the purpose of building this average, we removed the three largest Robo-advisors from our sample as it is highly unlikely that any of these advisors have similar AuMs. This leads to an average size of 0.169 billion for the remainder of the market. Under these assumptions, we cover 73% (\(\frac{8.1}{8.1+18\cdot 0.169}\)) of the German Robo-advisory market.

Table 1 German market for Robo-advice We display name, assets under management, start date and website

Full size table

All of the Robo-advisors examined use a similar web-based questionnaire to gather the relevant information for portfolio modelling. The questions result in variables that are comparable across all advisors. Robo-advisors have asked very few questions outside these categories.Where they have been asked they have been insignificant and outside the most influential factors. Stylized questionnaire information is summarized in table 2. We report the specific topic of a given question, the major theme it belongs to, its typical number of variations, data type and the number of advisors that ask a particular question. Not all advisors ask all or the same questions. The number of answer categories also differs. The only question that is common across all 16 examined Robo-advisors asks for the investment amount. This information is irrelevant for investors with constant relative risk aversion (these investors find that the optimal allocation to risky assets is the same independent of the investment amount or level).^{Footnote 3} Investor information required for each Robo-advisor is fairly generic and hardly personalized. This is consistent with Beketov et al. (2018), who find that Robo-advisors use naive mean-variance portfolio construction. No data to assess the client’s household balance sheet or human capital is collected. While we could derive a proxy for human capital from the monthly income figure, we would need many strong assumptions about the (average) investor’s age, profession or expected wage growth. This limits the ability to customize solutions, but potential clients might feel these questions are too intrusive and time-consuming to enter into a website. Time horizon and risk-aversion-related questions are also very common among advisors. However, risk-averse investors with a 10-year time horizon are not a homogeneous group that deserves to be lumped together to receive identical portfolios.

Table 2 Input data to questionnaire Answers to the questionnaire are stored in the following variables. We report the specific topic of a given question, the major theme it belongs to, typical variations, data type and the number of advisors that ask a particular question

Full size table

Efficient sets

What is the investment opportunity set offered by Robo-advisors? Table 3 summarizes our data set. We analyze 243.000 generic portfolio recommendations and their associated client characteristics across 16 German Robo-advisors. The data are gathered from the 1st of June to the 23rd of June 2021. To facilitate comparisons across Robo-advisors, we document the percentage of input combinations that result in allocations across 10 equity exposure bins. Equity allocations do not only contain equities. They contain all non-bond assets, i.e. equities, alternatives, real estate and commodities when offered.

We find that most (12 out of 16) Robo-advisors offer a parsimonious choice set of 10 or fewer portfolios. The remaining four advisors offer 11 79 or 19 portfolios. This does not only limit the scope for customization, it also shows at most very basic digitization. We suspect that all portfolios are pre-build rather than continuously created for each input combination. Existing Robo-advice comes in a tin. We interpret this as evidence for a scoring logic on top of an efficient frontier, rather than portfolio choice modeling with varying inputs from first principles.

Input combinations that lead to extreme allocations (100% equities or 100% bonds) are much less frequent than portfolios that carry intermediate risk. We view this as a safeguard against litigation risk. Corner portfolios are only offered if overwhelming user input justifies solutions that could be labeled as extreme (i.e. not diversified). In many cases, extreme portfolios are not even on offer. Only five Robo-advisors recommend an all-equity portfolio, while only one Robo-advisor recommends an all bond portfolio. The latter is at least in line with normative portfolio choice that demands minimum equity participation across all levels of risk aversion. Full (100%) bond allocations might also result in unattractive fees relative to return expectations in a low-interest rate environment. For example, fixed costs of 100 Euros would require an asset manager to charge 2% fees for a 5000 Euro account to merely break even. At the same time, most 10-year bonds in 2021 display negative yields in Euro (under either covered or uncovered interest rate parity).

Finally, we note that the extreme variation in investment opportunity sets will make it unlikely that two Robo-advisors recommend similar portfolios when faced with the same inputs. In most cases, this is no even feasible.

Table 3 Efficient set Recommended portfolio allocations for risky assets and their relative frequency. For each Robo-advisor, we compute the weight in risky assets (equities plus commodities) count their frequency with respect to 10 exposure bins ranging from 0-10% to 90-100% equities

Full size table

Questionnaires and portfolio theory

Each question in a given questionnaire is viewed as a potential explanatory variable in a multivariate regression model. Compulsory inputs should be useful in determining portfolio allocations. In an empirical model they should explain at least some of the variation in portfolio recommendations across clients with different personal characteristics. Therefore, we use the available questionnaire information to build a quantitative model to measure each question’s impact on final portfolio recommendations. Every answer to a question is stored as either an ordered factor (example: risk aversion of 1 is smaller than risk aversion of 2) or an unordered factor (example: investment goals, as no goal is larger than another goal). We group the required inputs from Robo-advisory questionnaires into five categories related to portfolio choice: risk aversion, wealth, time horizon, experience and investment goals, as shown in table 2. Before we present our results, we quickly summarize what to expect from the perspective of normative portfolio choice.

Time horizon and wealth^{Footnote 4} Normative portfolio choice allows multiple theoretical relationships. The classical view (time does not diversify) has been forcefully argued by Samuelson (1969) and reiterated to the investment community in Samuelson (1994). Samuelson’s solution (time horizon and recommended equity weights are independent) is well known to rely on the assumptions of CRRA utility, independent returns and lack of estimation risk. Once we change these assumptions, we can argue either case. If we change from CRRA to DRRA (decreasing relative risk aversion) the optimal allocation to equities increases with wealth^{Footnote 5} Equally, Campbell and Viceira (2002) argue for an increase in equity allocations as time horizons lengthen. Their work is driven by the predictability of equity returns using vector-autoregressive models. There is however, considerable estimation risk in regressions of this kind and previous relationships can be overturned (optimal allocation to equities decreases with time horizon) once we add substantial estimation risk (Barberis 2000). Empirically, Spaenjers and Spira (2015) find that the share of risky assets increases with the investor’s subjective (personal, i.e. mortality table adjusted) time horizon. Bodie and Crane (1997) also find that empirically the allocation to equities increases with time horizon and wealth. In our judgment, the work by Campbell and Viceira (2002) now define the academic mainstream. We view a positive relationship between time-horizon and risk-taking and no relation between risk-taking and wealth as most consistent with normative portfolio choice.

Experience The influence of investor knowledge and personal experience on risk-taking has not been subject to normative models of portfolio choice. Instead, empirical studies document a positive statistical relation between investor education and chosen portfolio risk (after controlling for wealth, and other characteristics).^{Footnote 6} The conjecture is that less cognitive ability might act as a psychological barrier to financial market participation. Unfamiliarity with a complex subject such as investing also increases costs (measured in time and money) for low-skill households and hence leads to lower levels of investment. Ampudia and Ehrmann (2014) show, that while experience has an impact on risk- taking, it is not experience per se, but the type of experience that matters. Investors with positive (negative) stock market experience are more likely to hold substantial (small) positions in risky assets. Grinblatt et al. (2011) show that cognitive skills decrease information costs and therefore increase the likelihood of participating in financial markets. Campbell (2006)finds evidence that stock market participation positively correlates with education. Hsu (2012) also argues that lower skills lead to lower wealth accumulation. If households also display decreasing relative risk aversion, optimal demand for risky assets will decrease with wealth levels as local risk aversion increases. However, this does not equate to normative advice. Rather to the contrary. Van Rooij et al. (2011) also find that a lack of financial literacy leads to lower stock market participation. From a normative perspective, we would not think that risk-taking depends on investor experience. From an empirical perspective, we would expect lower education to lead to lower risk-taking. Nudging inexperienced households to invest more aggressively than they initially desire would create economic gains for those households at the expense of regulatory and litigation risks.^{Footnote 7}

Goals. Questions concerning investment goals are behaviorally motivated but do not necessarily violate normative portfolio choice. Das et al. (2010) have shown that even though goal-based investing (building mental accounts) is behaviorally motivated, the portfolio of mental accounts plot close on the efficient-frontier. Proponents of goal-based investing will claim that investment goals differ in the required funding strategy to reach them. Bond allocations are optimal if the difference between current wealth and target wealth is low, the time horizon is short, and the required confidence is high. Equity allocations in turn are chosen for large differences in aspired to current wealth, little (high) required confidence and shorter (longer) horizons. Minimizing the probability of falling short of the funds needed to reach the respective goal is the implied measure of risk. Translated into our questionnaire, emergency funds are mainly invested in fixed income, while long-term or retirement objectives are best reached with equities. In our experience, this view has support among practitioners. Among academics, this is however disputed. The measurement of investment risk as the probability to underperform a wealth target is inconsistent with maximizing expected utility for well-accepted utility functions. In a mean-variance world, this has no consequences for efficient frontier portfolios. Mean-variance efficient portfolio sets also are mean-shortfall risk efficient (even though investors might choose different points along the mean-variance frontier). In reality, the world is non-normal, investors are not agnostic by how much a goal is not met and the combination of goal-based portfolios is not necessarily optimal in the presence of a long-only restriction.^{Footnote 8} In our view, any dominance of goal-based criteria would mark a deviation from normative portfolio choice.

Risk aversion Among the many inputs required from Robo-advisors questions, related to risk aversion should have the most direct influence on risk-taking. Higher risk aversion will lead to lower equity allocation. This is not only enshrined in normative portfolio choice but also meets regulatory demands for suitability criteria. We expect a negative relation, i.e. higher risk aversion leads to lower risk-taking.

What drives Robo-advice?

We established that Robo– advisors use similar, but still heterogeneous questionnaires. They differ in the number of variables, exact wording, number of variations available for each question, etc. This makes it difficult to summarize the impact of a given variable across Robo-advisors. We therefore chose the following approach.

1.
We run a separate parametric OLS regression with ordered (if applicable) factors as independent variables for each Robo-advisor. The dependent variable is the recommended equity allocation. In line with the literature we do not attempt to compute and add the implied equity allocation from other asset classes (for example high yield or corporate bond equity beta) to the recommended equity allocations. As we need to deal with mostly ordered factors, we can not use one-hot encoding or Helmert contrasts in our regressions but rather use orthogonal polynomial contrasts. A more detailed description of our modelling approach can be found in the "Appendix".
2.
We formally interrogate each regression model to identify the most influential variable(s). For this purpose, we borrow from the literature on interpretable machine learning and employ the following model agnostic algorithm suggested by Fisher et al. (2018). For each variable, we randomly permute the values of that particular feature and recompute the chosen performance metric, in our case \(R_{\rm perm}^{2}\). We then record the difference between the baseline metric and the permutated metric \(R_{\rm base}^{2}-R_{\rm perm}^{2}\) as our importance score.
3.
The three variables with the highest importance scores are then selected as the most influential variables. We then report the category a variable has been assigned to, together with the sign of their individual regression coefficient as well as the cumulative \({\bar{R}}^{2}\) from stepwise regressions. This gives us an indication of the importance of the modeled relationship. We confirm the direction of the relationship with partial dependence plot.

All results are presented in Table 4. Risk aversion-related questions play a dominating role for recommended equity portfolio weights across all Robo-advisors. For 12 advisors we find that the top input is related to risk aversion. The sign is negative across all advisors, i.e. higher risk aversion leads to lower weights in risky assets. Bach et al. (2020) show that risk-taking (revealed risk aversion) is a major driver of cross-sectional differences in household wealth. The top 1% of wealthiest households take more systematic risks, invest in more volatile portfolios and earn much higher long-term average returns. Investors need to carefully assess their willingness to take risks. We also find that recommended portfolios show higher equity allocations for longer time horizon investors while wealth hardly plays a role in portfolio recommendations. Only three Robo-advisors display statistically significant coefficients for wealth and in each of these cases the marginal R-square of the wealth variable turns out to be small. This makes it unlikely that Robo-advisors use utility functions with decreasing relative risk aversion. Instead, the evidence is more consistent with negatively sloping term structures of risk due to mean reversion in equity returns.

For 3 of our 16 Robo-advisors, we find that investor experience is used as the most important input variable. This is surprising given the weak theoretical underpinning of this variable. We attribute this observation to anticipated regulatory concerns, i.e. mitigation of business risks. MiFID II, article 25(2) requires investment firms to ask investors for their “knowledge and experience in the investment field relevant to the specific product or service”. This question is of interest as it finds no resemblance to the theory of portfolio choice. ESMA’s request is instead based on an implied conjecture: less experience should result in less risk-taking. Their guideline on certain aspects of the MiFID II suitability requirements (50) explicitly states “Firms should be alert to any relevant contradictions between different pieces of information collected, and contact the client to resolve any material potential inconsistencies or inaccuracies. Examples of such contradictions are clients who have little knowledge or experience and an aggressive attitude to risk, or who have a prudent risk profile and ambitious investment objectives.”^{Footnote 9}

Investment goals have a minor impact for all but one advisor, where investment goals explain 77% of the variation in recommended portfolio weights. We also find 3 advisors with an extremely simple model that is fully captured by changes in risk aversion only. All other variables are stored and used for non-investment purposes.

Table 4 Top 3 questionnaire categories For each Robo-advisor we run an OLS-regression with ordered and unordered factors (user input choices). Input variables are one by one randomized such that we can compute an importance score as the difference between the \(R^{2}\) of the original data and the randomized data. The larger the difference, the more important the variable. We show the top 3 variables (by category), their cumulative R-squared as well as the R-squared of a model using all variables

Full size table

Most regressions do not fully explain the dispersion in recommended equity weights. This is a clear indication of possible nonlinearities, i.e either nonlinear interactions across explanatory variables or threshold effects in individual variables. The latter is somewhat caught by the employed polynomial contrast used in our regression framework. Our average \({\bar{R}}^{2}\) is still around 82% and in virtually all regressions we do not find evidence that using more than three variables would significantly increase the model’s explanatory power. In other words, not all required by questionnaires has an impact on the final recommendation.

Our data show that current Robo-advisory offerings use inputs designed to locate investors on a given efficient frontier, while the frontier itself looks identical to all investors. What makes investors different are their various hedging demand originating from their household balance sheets but the relevant questions needed to model hedging demands are not asked. This is in stark contrast to portfolio choice in a modern multi-factor world as in Cochrane (1999). In a multifactor world, many investors will hold portfolios plotting below an efficient frontier as they can not take frontier-related factor risks. This is the whole point of a rational risk premium. Not every investor finds it optimal to take it. Investor heterogeneity arising from different investor balance sheets or differences in amount and characteristic (market or factor return) is largely ignored. This is somewhat disappointing as Robo-advisors fail to offer the advice Merton (1971) gave exactly 50 years ago: allocate between speculative demand (frontier portfolios, identical to all investors), cash and various hedging demands reflecting household balance sheets and exposures to systematic economic risks (different across individual investors).

We believe these choices are not made because of ignorance of the existing academic literature, but rather for commercial reasons. First, it is well known that trusted advice by “money doctors” as described by Gennaioli et al. (2015) reduces behavioral biases and can overcome complexity.^{Footnote 10} Earlier work by Sapienza et al. (2013) also finds the importance of trust for economic decision making. This statement is echoed by Merton (2017) in the context of Robo-advisory adoption rates: “What you need to make technology work is to create trust”. Hildebrand and Bergner (2020) make the same point.

But what creates trust? Jacovi et al. (2020) conjecture that (intrinsic) trust can be gained when recommendations line up closely with the user’s prior beliefs.^{Footnote 11} Hence portfolio recommendations receive more trust when they resemble solutions that coincide with the investor’s prior understanding of portfolio choice. For Robo-advisory as a business, there is likely a tradeoff between Merton (1971) and Merton (2017). Should the Robo-advisor offer theoretically consistent but initially unintuitive advice? A young government employee (assume 90% of his wealth is human capital that behaves like government bonds) with high risk aversion might still get a 100% equity portfolio. This is consistent as equities still only account for 10% of her total wealth. However, will the investor understand? Equally important, would that argument work in court after clients made large losses inconsistent with their stated risk aversion? Robo-advice as a business decides what works best in order to win and maintain new clients. Related work by Scherer and Lehner (2021) already provide evidence in this direction. Web-scrapping one of the largest US Rob-advisors, they document portfolio recommendations that are more consistent with client pre-perceptions rather than textbook financial modeling.

Conclusions

We estimate the impact of client characteristics gathered by Robo-advisor questionnaires on recommended portfolio structures for a large cross section of German Robo-advisors. Contrary to the academic progress on normative portfolio choice, we find that portfolio recommendations are driven mainly by questions with respect to risk aversion and investor time-horizon. Household balance sheets, human capital or economic hedging demands play no role. Instead, variables with little normative underpinning like personal experience or investor goals find their way into questionnaires or as Cochrane (2021) put it: “When theory is so persistently contrary to practice one of the two must be wrong”. Maybe the theory is just incomplete. The fact that Robo-advisors prefer a solution space that is more likely to confirm investors existing preconceptions makes business sense. It increases consumer trust and regulatory approval, both outside the scope of normative portfolio choice. Agency problems are everywhere.

Notes

See Beketov et al. (2018) or Bhattacharya et al. (2012). Hoechle et al. (2017) instead find financial advice helps to overcome behavioral biases.
See https://www.Robo-advisor.de from June 2021.
CRRA utility is still the mainstream utility function in finance for very good reasons, apart from analytical tractability. It is compatible with stable risk premia over the last 200 years, even though individuals became many times wealthier.
We group income into the wealth bucket as the present value of future savings reflects an investor’s human capital on her balance sheet. Higher levels of human capital are for most employees very bond-like (only one Robo asks for the profession as an input and the variable is not significant) and should hence increase the optimal allocation to risky assets.
Kritzman and Rich (1998) provide a taxonomy for alternative utility functions.
Lusardi et al. (2017) show that financial knowledge is a key determinant of equity market participation and Foltyn (2020) shows a positive relationship between experience and average shares in risky assets. Ampudia and Ehrmann (2014) show that the impact of experience can go either way (increase or decrease participation).
Superficially, we can label learning from past returns via Bayesian updating as experience. However, in Berk and Green (2004) investors simply learn about the ability of managers to generate positive or negative alpha from most recent realized returns. Depending on the sign of past returns, they decide to invest or not as investors need to chase promising funds before other investors do. Each additional flow dilutes alphas down towards zero. In our view, it would be highly irrational to base long-term asset allocation recommendations on personal investment biographies (across different time horizons). The right approach is to use economic state variables instead.
Suppose goal-based portfolio 1 is optimally short asset A, while portfolio 2 is long asset A. Joint optimization will lead to a partial or complete offset of these positions. Separate long-only optimizations will not.
See ESMA (2018), pp. 14–15.
See Hoechle et al. (2017) and Campbell (2016).
For financial practitioners, this is not new. The asset allocation model of Black and Litterman (1992) probably owes most of its success to a solution that is strongly anchored in a prior portfolio familiar to all investors (market portfolio).
See Venables and Ripley (2002), page 146 for a description of our methodology.
Our tree only ends up with seven final nodes (despite the existence of 11 portfolios). This is due to the use of (10-fold) cross validation to find the optimal complexity parameter (limiting the tree size). While more complex and better fitting trees can be found with changes in complexity parameters, the initial splits remain unchanged. Multi-class classification trees arrive at the same hierarchy.

References

Ampudia M. and M. Ehrmann (2014), Macroeconomic Experiences and Risk Taking of Euro Area Households, ECB Working Paper Series #1652
Bach, L., L. Calvet, and P. Sodini. 2020. Rich Pickings? Risk, Returns and Skill in the Portfolio in the Portfolios of the Wealthy, American Economic Review 110 (9): 2703–47.
Google Scholar
Beketov, M., K. Lehmann, and M. Wittke. 2018. Robo Advisors: Quantitative Methods Inside the Robots. Journal of Asset Management 19: 363–370.
Article Google Scholar
Bhattacharya, U., A. Hackethal, S. Kaesler, B. Loos, and S. Meyer. 2012. Is Unbiased Financial Advice to Retail Investors Sucient? Answers from a Large Field Study. Review of Financial Studies 25 (4): 975–1032.
Article Google Scholar
Black, F., and R. Litterman. 1992. Global Portfolio Optimization, 28–43. September/October: Financial Analysts Journal.
Google Scholar
Campbell, J.Y., and L.M. Viceira. 2002. Strategic Asset Allocation. Oxford University Press.
Book Google Scholar
Campbell, J. 2006. Household Finance. The Journal of Finance 61 (4): 1553–160.
Article Google Scholar
Campbell, J. 2016. Restoring Rational Choice: The Challenge of Consumer Financial Regulation. American Economic Review 106: 1–30.
Article Google Scholar
Cochrane J. 1999. Portfolio Advice for a Multifactor World, NBER Working Paper No. w7170, Available at SSRN: https://ssrn.com/abstract=217489
Cochrane J. 2021. Portfolios for long term investors, NBER working paper
Das, S.R., H. Markowitz, J. Scheid, and M. Statman. 2010. Portfolio Optimization with Mental Accounts. Journal of Financial and Quantitative Analysis 45 (2): 311–334.
Article Google Scholar
Deloitte .2016. Cost Income Ratios and Robo-Advisory. https://www2.deloitte.com/de/de/pages/financial-services/articles/robo-advisory-in-wealth-management.html
ESMA. 2018. Guidelines on certain aspects of the MiFID II suitability requirements, ESMA35-43-1163.
Faloon, M., and B. Scherer. 2017. Individualization of Robo-advice. Journal of Wealth Management 20: 30–36.
Article Google Scholar
Fisher, A., C. Rudin, and F. Dominici. 2018. Model Class Reliance: Variable Importance Measures for Any Machine Learning Model Class, from the ”Rashomon” perspective. arXiv preprint arXiv:1801.01489
Foerster, S., J.T. Linnainmaa, B.T. Melzer, and A. Previtero. 2017. Retail Financial Advice: Does One Size Fit All? Journal of Finance 72 (4): 1441–1482.
Article Google Scholar
Foltyn, R. 2020. Experience Based Learning, Stock Market Participation and Portfolio Choice, SSRN working paper
Gennaioli, N., A. Shleifer, and R. Vishny. 2015. Money doctors. Journal of Finance 70 (1): 91–114.
Article Google Scholar
Grealish, A. and P. Kolm. 2021. Robo Advisory: From Investing Principles and Algorithms to Future Developments. In Machine Learning in Financial Markets: A Guide to Contemporary Practice, eds. Capponi A and C. Lehalle. Cambridge: Cambridge University Press, forthcoming
Grinblatt, M., M. Keloharju, and J. Linaaunmaa. 2011. IQ and Stock Market Participation. Journal of Finance 66 (6): 2121–2164.
Article Google Scholar
Hildebrand, C., and A. Bergner. 2020. Conversational Robo advisors as surrogates of trust: onboarding experience, firm perception, and consumer financial decision making. Journal of the Academy of Marketing Sciencehttps://doi.org/10.1007/s11747-020-00753-z.
Article Google Scholar
Hoechle, D., S. Ruenzi, N. Schaub, and M.M. Schmid. 2017. The Impact of Financial Advice on Trade Performance and Behavioral Biases. Review of Finance 21: 871–910.
Article Google Scholar
Hsu, C. 2012. What Drives Equity Market Non-Participation? North American Journal of Economics and Finance 23: 86–114.
Article Google Scholar
Jacovi, A., Marasović, A., Miller, T. and Goldberg, Y. 2020. Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI. arXiv preprint arXiv:2010.0748.
Kritzman, M.P., and D. Rich. 1998. Beware of Dogma: The Truth About Time Diversification. Journal of Portfolio Management 24: 66–77.
Article Google Scholar
Lusardi, A., P.C. Michaud, and O.S. Mitchell. 2017. Optimal Financial Knowledge and Wealth Inequality. Journal of Political Economy 125: 431–477.
Article Google Scholar
Merton, Robert C. 1971. Optimum Consumption and Portfolio Rules in a Continuous-Time Model. Journal of Economic Theory 3: 373–413.
Article Google Scholar
Merton R. .2017. The future of Robo advisors, CNBC video https://www.cnbc.com/2017/11/05/mit-expert-robert-merton-on-the-future-of-Robo-advisors.html.
Mullainathan, S., Noeth, M. and A. Schoar (2012), ’The market for financial advice: An audit study’, NBER Working Paper (17929).
Puhle, M. 2019. The Performance and Asset Allocation of German Robo-Advisors. Society and Economy, 41(3).
Sapienza, P., A. Toldra-Simats, and L. Zingales. 2013. Understanding Trust. The Economic Journal 123 (573): 1313–1332.
Article Google Scholar
Scherer, Bernd and Lehner, Sebastian, What Drives Robo-Advice? .2021. Available at SSRN: https://ssrn.com/abstract=3807921 or doi: https://doi.org/10.2139/ssrn.3807921
Soehnke M., J. Branke, and M. Motahari. 2020. Artificial Intelligence in Asset Management, working paper, Judge Business School.
Tertilt M. and P. Scholz. 2018. To Advise, or Not to Advice - How Robo-Advisors Evaluate the Risk Preferences of Private Investors, Journal Of Wealth Management, Fall 2018, v21, n2, 70–84.
Torno A. and S. Schildmann. 2020. What Do Robo-Advisors Recommend? An Analysis of Portfolio Structure, Performance and Risk. In Enterprise Applications, Markets and Services in the Finance Industry, eds. B. Clapham, J. and A. Koch. FinanceCom 2020. Lecture Notes in Business Information Processing, vol. 401. Springer, Cham. doi: https://doi.org/10.1007/978-3-030-64466-6_6.
Torno, A., D. R. Metzler, and V. Torno. 2021. Robo-What?, Robo-Why?, Robo-How? A Systematic Literature Review of Robo-Advice. Proceedings of the 25th Pacific Asia Conference on Information Systems (PACIS), Dubai.
Van Rooij, M., A. Lusardi, and R. Alessie. 2011. Financial literacy and stock market participation. Journal of Financial Economics 101 (2): 449–472.
Article Google Scholar
Venables, W.N., and B.D. Ripley. 2002. Modern Applied Statistics with S. New York: Springer.
Book Google Scholar

Download references

Author information

Authors and Affiliations

EDHEC Risk Institute, 393-400, Promenade des Anglais, France
Bernd Scherer & Sebastian Lehner
Schumpeter School of Business, University of Wuppertal, Wuppertal, Germany
Bernd Scherer & Sebastian Lehner

Authors

Bernd Scherer
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Lehner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bernd Scherer.

Ethics declarations

Conflict of interest

The material presented is for informational purposes only. The views expressed in this material are the views of the authors. The authors claim no conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Statistical model

This Appendix illustrates our statistical model for a specific Robo-advisor (Scalable) with 7290 input permutations. We run an OLS regression with (ordered, if applicable) factors as independent variables. The dependent variable is the recommended equity allocation. Our results are shown in table 6. The intercept of 53.483% represents the base case allocation to equities. All other regression coefficients describe the marginal effects of answering questions on the robo-advisor homepage across all 7290 choice sets. As we deal with mostly ordered factors, we can not use one hot encoding or Helmert contrasts but rather use orthogonal polynomial contrasts.^{Footnote 12} The extensions .L, .Q, .C denote coefficients from linear, quadratic and cubic regression terms.

For example, the contrast coding for experience is as ordered factor with three levels (0,1,2) where we add the number of “yes” answers with respect to product and financial service experience. Table 5 shows the corresponding contrast. A regression coefficient of 1.699 for the linear contrast on knowledge means that an investor twice ticking the box “none” receives a \(-0.7\cdot 1.699\%=-1.18\%\) (percentage points) lower equity recommendation than the base line allocation, while an investor with extensive knowledge will receive a recommendation to add \(1.18\%\) to the baseline allocation. For the full effect, we need to add the quadratic contrast or alternatively look at partial dependence plots. Responses to investment goals are easier to interpret as they are modeled as unordered factors using one-hot dummy encoding. A value of -12.349 for wealth preservation means a (ceteris paribus) decrease of 12.349% in the recommenced equity allocation for all investors ticking this box.

Table 5 Contrast matrix. We show the orthogonal polynomial contrast matrix for three levels (0,1,2)

Full size table

Our regression model explains 97% of the variance of equity allocations. The standard error of the regression (standard deviation of fitted versus actual portfolio recommendations) is 4.96, i.e 2/3 of all predictions are within +/− 4.96% difference to the true value, even though our model did not use any interaction term. However, almost the same performance can be achieved by only including risk aversion and investment goals. The explanatory power fall slightly to 95%. All other variables only account for an additional 2% in explanatory power.

Next, we want to more formally interrogate our regression model to find the most influential variable(s). For this purpose we borrow from the literature on interpretable machine learning and employ the following model agnostic algorithm suggested by Fisher et al. (2018). For each variable we randomly permute the values of that particular feature and recompute the chosen performance metric, in our case \(R_{\rm perm}^{2}\). We then record the difference between the baseline metric and the permutated metric \(R_{\rm base}^{2}-R_{\rm perm}^{2}\) as our importance score. In order to understand the added value of a given variable we look at model results when the observations for the variable under investigation are reshuffled. Shuffling an important variable will lead to a much larger drop in explanatory power than shuffling an unimportant variable. We repeat this procedure 100 times and estimate the average importance score. The results are shown in figure 1. This confirms our earlier results. Risk aversion and investment goals are the most important variables. Creating noise in these variable leads to the most severe reduction in explanatory power across all variables.

Table 6 Drivers of Robo-advice. OLS regression results of 7290 equity allocation recommendations against input choices with respect to all variable in the Robo-advisor’s questionnaire. The adjusted \(R^{2}\) of the regression is \(95.62\%\). The standard error of the regression (standard deviation of fitted versus actual portfolio recommendations) is 4.962, i.e 2/3 of all weight predictions are within ± 4.962 difference to the true value

Full size table

Finally, we check the direction of influence for each question in the Robo-advisor’s questionnaire. In order to account for nonlinear contrasts we need to compute the cumulative effect of all polynomial terms. For this purpose, we use partial dependence plot as shown in Fig. 2. The idea of partial dependence plots is to estimate a statistical model using the original data and then use this fitted model to make predictions from a modified data set. The modified data set is a complete copy of the original data set, except for the variable of interest where all realizations are replaced by a particular value. The average across all predictions is then used as best estimate for the partial variation of interest. After repeating this process for all level of the variable of interest we can plot this variable against the average responses in a scatterplot. This plot is called partial dependence plot and shows both direction and magnitude of influence.

We can confirm our results by employing a regression tree in Fig. 3. . Regression trees use explanatory variables to consecutively split the data (using only one variable at each node) into pure clusters with as little intra-cluster variation as possible. Clusters do not need to have the same size (do not need to contain the same number of variations). Instead of making continuous predictions all combinations of explanatory variables that lead into a given terminal node carry the same prediction. In our context (recommended equity allocations from questionnaire inputs, regression trees offer some advantages over linear regressions. The first split selects the most important variable, while the sequence of splits is able to model non-linearities. This allows us to find otherwise hidden nonlinear interactions. While linear regressions can also uncover nonlinear interactions by including all possible cross terms, this requires as many right-hand-side variables as data points and thus results in a loss of all degrees of freedom. Our fitted regression tree identifies the same set of variables as most important in explaining the cross section of equity recommendations. Interestingly its standard error of 1.98 is less than half of a linear regression model, which we take as evidence for nonlinear interactions not covered by a linear regression. High equity recommendations are reserved for investors with low risk aversion, agrressive goals, large levels of wealth and sufficient experience.

Our fitted regression tree regards time horizon as the first variable to split the data on. Suppose we would be only allowed to split the data once into clusters with as little internal dispersion of equity weights as possible. Our regression tree would then recommend to use the variable time horizon. In this sense time horizon is the most important variable. Short and medium term horizon investors receive recommended allocations between 21.14% (node 4) and 49.73% (node 8) equities while long horizon investors obtain portfolios between 53.6% (node 10) and 73.95% (node 13)%. All percentages are predictions from the regression. tree. The standard error of the regression tree (standard deviation of fitted versus actual portfolio recommendations) is 5.51. Five from six nodes use the variables investment goal and time horizon. This again confirms our previous analysis. Predicted equity recommendations as a function of questionnaire replies rise from left to right. The most aggressive allocations are reserved to long term investors with retirement objectives that react to losses by increasing their equity allocations. Investors with short time horizon looking to fund an upcoming expense receive small equity allocations.^{Footnote 13}

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Scherer, B., Lehner, S. Trust me, I am a Robo-advisor. J Asset Manag 24, 85–96 (2023). https://doi.org/10.1057/s41260-022-00284-y

Download citation

Revised: 22 August 2022
Accepted: 02 September 2022
Published: 29 October 2022
Issue Date: March 2023
DOI: https://doi.org/10.1057/s41260-022-00284-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Trust me, I am a Robo-advisor

Abstract