Python for Finance: Robo Advisor Edition |

Python for Finance: Robo Advisor EditionExtending Stock Portfolio Analyses and Dash by Plotly to track Robo Advisor-like Portfolios.

Kevin BollerBlockedUnblockFollowFollowingMay 1Photo by Aditya Vyas on Unsplash.

Part 3 of Leveraging Python for Stock Portfolio Analyses.

Introduction.

This post is the third installment in my series on leveraging Python for finance, specifically stock portfolio analyses.

In part 1, I reviewed a Jupyter notebook with all of the code needed to extract financial time series data from the Yahoo Finance API and create a rich dataframe for analyzing portfolio performance across individual tickers.

The code also included a review of some key portfolio metrics with several visualizations created using the Plotly library.

In part 2, I extended Part 1’s analyses and visualizations by providing the code needed to take the data sets generated and visualize them in a Dash by Plotly (Dash) web app.

In this series continuation, I will provide an overview of Robo Advisors and then share additional code and details on how to evaluate a diversified index strategy.

This strategy can be used for several personal finance use cases, including as part of a holistic approach that combines ETFs with individual stocks and bonds.

It could also be used to evaluate the efficacy of a Robo Advisor alongside a personally managed ETF strategy.

Finally, one of the largest limitations from my initial approach was that the analyses did not account for dividends and compare total shareholder return.

Total shareholder return is now incorporated — in my view, this is one of the largest gaps I’ve seen in retail investor personal portfolio apps.

It is extremely difficult to get an all-up view of portfolio performance, across investment timings and including earned dividends.

My approach now accounts for both, which was a personal pain point that led me to solve this with my own product.

I will continue to evolve this portfolio performance web app; and I’ll share future updates to see if there is a broader market for this approach as a consumer-facing app.

Disclosure: Nothing in this post should be considered investment advice.

Past performance is not necessarily indicative of future returns.

I am writing about generalized examples and show how to import data using pandas for a model portfolio.

You should direct all investment related questions that you have to your financial advisor and perform your own due diligence on any investments mentioned in this post.

Therefore, I assume no liability for any losses that may be sustained by the use of the method described in this post, and any such liability is hereby expressly disclaimed.

Overview of Robo Advisors.

Per NerdWallet, a robo-advisor is “an online, automated portfolio management service”.

Robo Advisors use algorithms, which are based on answers that the individual user inputs and drive investment selection for the individual based on her risk tolerance and investment horizon (aka, time to retirement and/or financial objectives).

Robo Advisors offer much lower costs than traditional human financial advisory, with the tradeoff being that you generally do not have anyone to personally consult with regarding your financial objectives.

Robo Advisors are typically best suited for passive investors, who are comfortable with someone else building and optimizing a personal portfolio, and who also do not have complex financial situations.

Some of the major and most well-known Robo Advisors include Wealthfront, SoFi and Betterment.

Personal Capital is another option in this space, although the company does not believe it should be classified as a Robo Advisor — this is because that Personal Capital combines a sophisticated budgeting and portfolio monitoring application with virtual human financial advisors.

Personal Capital markets itself as a financial technology platform that can also advise higher net worth individuals and families who have more complicated financial situations.

In this post, we will leverage a diversified ETF example that Personal Capital provides on its Wealth Management performance page.

I believe this is informative because I respect Personal Capital’s approach and they’re very transparent with their performance relative to benchmarks.

Passive versus Active Investment Strategies.

Over the years since I initially started investing, I’ve focused more and more on making my investing strategy as boring as possible.

This means that I’ve subscribed to more of a passive investment strategy and I acknowledge that attempting to beat the market is extremely difficult.

In Part 1 of this series, I noted that over the long-term 1 in 20 actively managed domestic funds beat index funds (link).

Outperformance over a prolonged period is very difficult to maintain and previous outperformers tend to revert to the mean of benchmark performance over the long term.

Further emphasizing this point, it was recently announced that, for the 9th year in a row, active fund managers trailed the S&P 500.

Active managers, who previously claimed they would do better during periods of increased volatility, will have to go back to the drawing board once again.

Given this, part of my focus is in a diversified ETF strategy — this employs high quality ETFs that have a very low-cost structure and provide diversification across asset classes.

While I prefer my investment strategy to be as boring as possible, I continue to also invest in individual stocks that meet several investment criteria, including accelerating revenue growth, earnings outperformance and, ideally, the development of products that I personally love.

Part 1 and Part 2 of this series cover in detail how to track individual stock performance relative to the S&P 500, which is similar to how we’ll evaluate a diversified ETF strategy in this post.

Part 3’s Code Implementation.

Setup.

Similar to Parts 1 and 2, I created a repo on GitHub with all files and code required to create the final Dash dashboard.

For this post, I will explain the following aspects of the code:Target allocation for ETFs relative to current allocation.

Adding earned dividends to each position, and the benchmark, in order to calculate total shareholder return relative to a benchmark.

Evaluating total shareholder return for the model portfolio relative to a single benchmark, in this case the S&P 500.

The Jupyter notebook in the repo for this post has all the code needed from start to finish, but if you would like the full explanation on generating the portfolio data set, please refer to part 1.

If you would like more detail on working with Anaconda, virtual environments, and working with Dash, please see the Getting Started section in part 2.

As discussed at the end of Part 2, the limitations to the previous approach were i) it did not account for dividends, ii) it evaluated active positions and did not include previously divested ones, and iii) there were opportunities to automate the overall process by generating data pipelines that could feed into a live web application.

In my view, not including dividends and being unable to evaluate total shareholder return (TSR) were the largest gaps; and the updated approach in this post evaluates TSR.

I’m less concerned with divested positions, because this evaluation is most useful for evaluating how well your strategy is performing and if there are positions you continue to hold that you probably should not, e.

, lagging benchmark and therefore representing both overall performance drag and opportunity cost in not holding a better investment.

While I would like to fully automate this process, I’ve de-prioritized that in favor of refining my overall strategy.

If I do decide to pursue a fully automated approach, I’ll decide if it merits a separate post.

Target Allocation.

For the code discussion, we’ll review parts of the Jupyter notebook — an interactive version can be found here.

If you would like details on the dataframe development and closing high evaluation, please review part 1.

Below I’ll highlight the primary addition to the dataframe development, which is understanding and comparing target allocation versus current allocation.

In our model portfolio, we would like to have 50% of our investments allocated to VTI, which is a total stock index ETF for US equities.

As prices change post investment, our allocation will shift away from 50% based on the movement of this asset, as well as the other assets in our model portfolio.

For that reason, we should monitor this movement, e.

, quarterly, and adjust the assets held in order to get back to our target allocation.

For example, if VTI increases above 50% and VEU dips below 25%, then we should sell down some of VTI and purchase more of VEU to get back to our targets (assuming you want your model portfolio’s overall investment to remain neutral, which is similar to robo-advisor strategies).

The below starts on line 32 and stops on line 36 in the notebook.

# This dataframe will only look at the positions with an intended contribution, if applicable.

merged_portfolio_sp_latest_YTD_sp_contr = merged_portfolio_sp_latest_YTD_sp[merged_portfolio_sp_latest_YTD_sp['Target_Alloc']>0]merged_portfolio_sp_latest_YTD_sp_contr# Shorten dataframe to focus on columns that will be used to look at intended versus current allocations.

merged_portfolio_sp_latest_YTD_sp_contr_subset = merged_portfolio_sp_latest_YTD_sp_contr[['Ticker', 'Target_Alloc', 'Cost Basis', 'Ticker Share Value']]merged_portfolio_sp_latest_YTD_sp_contr_subset# If you've bought multiple positions at different times, this pivot table will aggregate the sums for each position.

merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot = merged_portfolio_sp_latest_YTD_sp_contr_subset.

pivot_table( index=['Ticker', 'Target_Alloc'], values='Ticker Share Value', aggfunc='sum')merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot.

reset_index(inplace=True)merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot# These new columns calculate the actual allocation to compare to the target allocation.

merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot['Total'] = merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot.

loc[:, 'Ticker Share Value'].

cumsum()merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot['Allocation'] = merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot['Ticker Share Value'] / merged_portfolio_sp_latest_YTD_sp_contr_subset_pivot.

iloc[-1, -1]merged_portfolio_sp_latest_YTD_sp_contr_subset_pivotmerged_portfolio_sp_latest_YTD_sp_contr_subset_pivot.

sort_values(by='Target_Alloc', ascending=False, inplace=True)merged_portfolio_sp_latest_YTD_sp_contr_subset_pivotIn the file we read in, we look at all positions that we noted with a Target Allocation; this provides flexibility in the event you want to track a position’s performance but are not initially investing in it.

We subset the dataframe and then use the pivot_table method to aggregate total market value for each holding.

As noted in the code, we run this pivot because, over several years, you’ll likely invest in the positions in this model portfolio numerous times — this pivot lets you look at your all-up allocation given current market prices for each.

We then sum up each position’s total market value (Ticker Share Value) and divide it by the total market value of the overall portfolio; this allows you to compare current allocation, based on market price changes, relative to your initial target allocations.

Total Shareholder Return.

Above line 54 in the notebook, you’ll see a Dividends section.

Here I’ve gathered dividend data for each position, and you’ll see in the notebook the URLs for the sites where you can gather dividend data.

I’ll be self-critical here and admit that this section of the code has some room for improvement.

I’ve investigated accessing Quandl’s API to automate importing dividend data, but I decided for now that it was not worth paying for a subscription (dividend data is pay walled).

I’m also looking into developing a scraper for these sites; but for now, I’m copying/pasting data from the sites into the Historical Dividends excel file and manually cleaning it up a bit.

This is generally fine as dividends tend to be paid quarterly and the model portfolio only has 7 total positions.

I definitely welcome any suggestions on how to improve this section of the code.

Note, however you decide to aggregate dividend data, you should make sure that you only account for dividends where your acquisition date of the position is less than or equal to the Ex-Div Date (see function, from line 60, shown as the first code block below).

# This function determines if you owned the stock and were eligible to be paid the dividend.

def dividend_post_acquisition(df): if df['Ex-Div.

Date'] > df['Acquisition Date'] and df['Ex-Div.

Date'] <= stocks_end: val = 1 elif df['Ex-Div.

Date'] <= df['Acquisition Date']: val = 0 else: val = 0 return val# subset the df with the needed columns that includes total dividends received for each position.

merged_subsets_eligible_total = merged_subsets_eligible.

pivot_table(index=['Ticker #', 'Ticker', 'Acquisition Date', 'Quantity' , 'Latest Date', 'Equiv SP Shares'] , values='Dividend Amt', aggfunc=sum)merged_subsets_eligible_total.

reset_index(inplace=True)merged_subsets_eligible_totalOnce you have created the eligible dividends dataframe, based on the ex-div dates for each stock since you’ve held it, the above code aggregates the total dividends received for each position and also includes the Equiv SP Shares column; this is needed to compare each position’s total dividends received relative to TSR an equivalent SP500 investment would have returned.

# This df adds all of the columns with the SP500 dividends paid during the range that each stock position was held.

# For comparative holding period purposes.

dividend_df_sp500_2 = dividend_df_sp500agg0_start_date = datetime.

datetime(2014, 4, 21)dbc1_start_date = datetime.

datetime(2014, 4, 21)igov3_start_date = datetime.

datetime(2014, 4, 21)veu4_start_date = datetime.

datetime(2014, 4, 21)vnq5_start_date = datetime.

datetime(2014, 4, 21)vti6_start_date = datetime.

datetime(2014, 4, 21)vti7_start_date = datetime.

datetime(2014, 5, 5)dividend_df_sp500_2.

loc[:, 'agg0_sum'] = dividend_df_sp500_2[(dividend_df_sp500_2['Ex-Div.

Date'] > agg0_start_date) & (dividend_df_sp500_2['Ex-Div.

Date'] <= stocks_end)].