Some think Blockchain is just digital money. Some think Blockchain will be as transformational as the internet. And some think Blockchain is overhyped. But as time has passed, the skeptics have gotten quieter.
Blockchain technology is best known for its underlying role in enabling Bitcoin, a cryptocurrency that went from less than $0.018 per coin in 2010 to above $18,000 per coin in 2017. A blockchain itself, however, is use-case agnostic: it’s a protocol that allows decentralized trust in tracking transactions, in a sort of open-ledger system where everyone is guaranteed to have an immutable, up-to-date copy of the chain of digital “blocks” detailing all of the network actions.
In the simplest terms, a blockchain is true to its namesake: it’s a list, or chain, of groups of data records, or blocks, that grows as more and more data is added to the system. It differs from a traditional database in two key ways: (1) once a block has been added to the chain, it cannot be changed, and (2) thanks to cryptographic advancements, a blockchain can be widely distributed on any network of computers. This means everyone can see copy that they know is up-to-date and trust the information encoded, requiring no central party or single server host. More importantly, it also means that anyone can use the blockchain to interact.
Blockchain is a technology that people say will change the world. But beyond currency, nobody is yet certain on what the biggest new uses will be. All anyone knows is that to discover them, smart, hardworking people will have to do a lot of experimenting, prototyping, and educated guessing. And that’s exactly what the students of ES95r’s Blockchain section do best.
Though high potential, Blockchain technology, first created in 2008, is still relatively new to a tech world that took twenty years to warm up to the internet and another ten to finally use it to fundamentally change society. Nonetheless, Fortune 500 companies like NASDAQ and Liberty Mutual, national governments of countries like Sweden and Russia, and universities like MIT and Berkeley are already experimenting with Blockchain.
The promises of blockchain are important. They are not just financial; blockchain could provide the technology to finally solve many of the world’s biggest problems. Imagine a world with no voter fraud and no voter suppression, enabled by trustable digital identities that finally replace broken SSN-authentication dependence. Imagine a world where health records are secure yet transferring them between providers is seamless when care is needed, enabled by smart contracts and identity tokens. Imagine a world where green energy is finally all-servicing, enabled by microtransactions between distributed owners of solar panels, wind farms, and dams, outside of a utility’s monopoly. You might imagine these worlds, but others are using blockchain to create them.
The blockchain community at Harvard can hardly be called a community. While SEAS enrollment and the CS concentration size continues to rise, many SEAS students are completely unfamiliar with the technology. Some only know about Bitcoin, unaware of the powerful cryptographic ledger that enables it. Until mid-fall 2017, there was not even an unofficial student interest group.
Cue ES95r’s special blockchain section: a group of 25 Harvard students that split into teams of blockchain explorers, meeting for two hours every Wednesday to update each other on their technical or research progress and learn from each other about the space.
Though the stereotypical “Blockchain enthusiast” is a Silicon Valley-type techie, the students of ES95r drew a diverse set of Harvard students. We were philosophers and computer scientists, applied mathematicians and mechanical engineers, economists and VES concentrators. We were varsity lacrosse players and entrepreneurs, writers and music producers, designers and stand-up comedians. And we were west coast cool and east coast cold, internationals and gap-year takers, from the quad, river, and off-campus alike. If anything, the numerous interests and backgrounds of the students should illustrate that slowly but surely, everyone, in every different sector, is starting to pay attention to Blockchain.
Though ES95r drew from diverse interests, the ethnicities in the class were largely those traditionally represented in computer science, and the class was roughly 86% male - 14% female. Were Harvard to put more resources into the space, it would hopefully make it more accessible to those not traditionally in blockchain circles.
Smart Contracts that trigger based on input data from humans the two parties choose. Github Repo
We began our exploration with the intention of building a product or service not purely investment- or trading-related, to see if we could use the technology to create value that went beyond providing investors better returns. A such, we first took a wide survey of potential spaces, focusing on the system-level -- real estate, the legal system, healthcare, industrial equipment, and social.
Though we found inefficiencies in these systems, most could be solved by non-blockchain innovation, especially given the technical maturity of available blockchain tools. This led us to focus on giving others the power to use smart contracts more easily, via a sort of IFTTT interface, and seeing what they came up with, though we found difficulty in making onboarding users an easy process. Throughout, we also taught ourselves Solidity in order to build our technical prototypes.
Ultimately, we focused most on building our understanding of the technical space to allow for lower-level innovation in the future, as our exploration taught us that the biggest potential in the blockchain space still seems to be at the protocol level: creating oracles, optimized blockchains for specific use-cases, or integrations with other blockchains.
We began with research and technical exploration into the space. Those of use who took a more technical path (or who chose to do both technical and research) taught themselves solidity, while others researched current opportunity spaces. After landing on the IFTTT for smart contracts concept, we completed user research with small business owners and venmo super-users, developed a working backend prototype for triggering smart contracts based on events, and created a UI.
This space is brand new. We are still attempting to figure out some of the most fundamental problems like how to increase confirmation of transaction times and increasing throughput of transactions. It is nonsensical to build applications upon a foundation that isn’t fully fleshed out. On top of this, unlike the internet, most of the money in blockchain is reserved for the protocol level innovations. Union Square Ventures has a blog post where they talk about this in more detail.
The difficulty of getting real-world event data onto the blockchain is not that IoT devices or APIs don’t exist, but rather building an equally trustless, unhackable system to blockchain’s other operations. Some, like smartcontract.com, try to use a token-based system that rewards truth-telling, while we looked at using social circles, even between strangers, as the trust providers. Regardless of the ultimate solution, it’s clear that the oracle problem will not be solved simply with better hardware sensors: software will need to be involved.
Solidity is the coding language for Ethereum. For how new it is, there are actually a ton of resources online that will help you learn how to develop for the Ethereum blockchain. We particularly found Truffle and TestRPC most helpful when we were developing for various applications throughout the class. Regarding tutorials, we found going through all of the tutorials on the Ethereum were very helpful for basic skills. After that, we expanded to other tutorials that helped us learn to integrate Ethereum into different tools like Chrome through using the MetaMask Chrome plugin.
To continue this project, we would want to do further user research, beyond discussions and UI walkthroughs with local businesses and students. We envision a larger-scale simulation game we hold for ~20 people that puts our assumptions about social trust, and the number of trusted versus untrusted parties one needs in a trustable interaction, through extreme stress.
Logistics Network could provide API-based smart contracts for tracking packages.
Smart contracts, despite their elegance, are quite limited by the fact that the blockchain has to be deterministic, such that the result of a computation must be completely verifiable on-chain. This means that smart contracts usually can’t have bearing on the outside world without an “oracle.” I thought that a great way to expand smart contract usage would be to allow an real physical items to be traded using smart contracts.
Utilizing existing APIs for logistics networks (FedEx, Uber) and oracles to link them up to contracts in the Ethereum blockchain. Through the process, I was able to set up a rudimentary oracle on oraclize, although the actual functionality of it was limited due to costs.
I implemented a connection between live, third-party APIs and the ethereum blockchain. Since it was difficult to set up complex API queries on the blockchain using Solidity, I instead opted to set up an outside server that took a simple request and a code and sent a corresponding request to FedEx, then had oraclize send a request to this server I set up.
In the process, I learned a lot of Solidity, and the specific security concerns that developers must consider. One of the biggest challenges I faced was trying to figure out bugs in my contract code, especially since there is not much literature online about setting up oracles in a consistent way.
The next step would be to create a convenient web-app to be able to create smart contract code without learning solidity. This will allow the everyday user to create logistics contracts, which will certainly increase the number of people using smart contracts.
Building Ethereum tools to allow financial innovation. Github Repo
We began by pinpointing the characteristic advantages of the blockchain and the corresponding weaknesses of the current investment processes in emerging and frontier markets. We determined that investors fail to (or rarely) invest in these areas given the lack of data they have about the regions as well as the lack of security in capital transactions. The blockchain, we decided, provided a good mechanism for secure transactions and was a viable source of legitimate data.
Given that we had no prior expertise on smart-contract implementation as well as design best practices, the majority of the first half of the semester consisted of finding the right resources to teach ourselves how the language operated, and how the different elements of decentralized applications operated with each other. In parallel, we had to align our product strategy with current limitations and potential roadblocks to perceptions and technical difficulties involving the blockchain technology. Cryptocurrency adoption was another issue we had to design our application around so that the volatile and unpredictable nature of the currency would not be a deterrent for institutions and businesses to adopt our technology.
Throughout the rest of the semester, we began to implement different aspects of our platform, including a working currency minting and destroying smart contract (Solidity), a user interface framework (JavaScript, Web3), part of a digital signature encryption mechanism (JavaScript), as well as a piece of a financial-institution registration platform (JavaScript).
Throughout this entire process, a lot of our time was used looking for the right resources that would teach us best design and implementation practices.
We have learned a lot and hope to continue our progress in this endeavor, but would have liked some support in our technical intelligence. We, however, understand that this is a nascent field, and are excited for what Harvard will offer in the future.
An application based cryptocurrency trading algorithm and research-based set of metrics for evaluating ICOs. Github Repo
Our group completed a two-fold project: one was application-based and the other was research based.
Application-based We designed an algorithm for trading based on a Deep Recurrent Q-Network (DQN). The algorithm is designed to predict the expected levels of relative future profit (reward) from owning USD vs ETH/BTC at a given timepoint. DQNs have been used in other domains, such as playing Atari video games at beyond human levels. We have implemented a DQN framework that can take in an arbitrary neural network as a predictor and train it to perform predictions about expected reward, given arbitrary transaction fees on the exchange.
We have tested the approach using a network on toy examples to demonstrate that the approach is viable, and have since moved on to testing on hourly Bitcoin price data. So far it appears to perform reasonably, but we are continuing to test on larger amounts of data. We’ve currently been testing on periods where Bitcoin does very well, so it is difficult to fairly evaluate our model.
Research based: We developed a venture-capital type valuation system to determine what metrics make for a valuable ICO. Over the course of the term, we analyzed pricing data from top-performing ICOS and categorized them into both categorical and quantitative variables, looking at what positively correlated pricing for successful ICOs. Among the largest categorical predictors was at least one person in the group that had a significant amount of blockchain experience.
Interestingly enough, the names of many of these people were not public facing and instead sat behind pseudonyms (such as the guy behind Augur). Surprisingly, having a white-paper was not an important valuation metric since almost all the ICOs we analyzed had whitepapers. We found that the more technically complex ideas generally tended to do well, but this association was weak.
In the “Analysis” excel document, the second “Highest growth ICOs”, includes a risk analysis of how risky each investment is. The program took into account the presence of a whitepaper, how long it had been since the ICO date, average growth percentages and had adjusted for the rise in the crypto market as a whole, basically trying to see how the coins did when they only had to go up against themselves.
We also found that the Chinese government’s ban on ICOs had an overall negative effect on the overall amount of funding that ICOs received. For ICOs that were released before the ban, there was an overall uptick in month-by-month investment. Basically, if a project decided to ICO before the China ban date, they’d see an average of 2% more capital growth every month, but after the ban there was overall less investment in ICOs. There could be other confounding variables, but on a baseline this suggests that China is more bullish on ICOs compared to the rest of the world. The raw data for this that we based analysis on is included in the crypto_prices csv file, if interested.
In regards to qualitative data, we found that successful ICOs (up until this point) have had three relevant price spikes on average - the first happens right at the point of release, which is where the first valuation happens. There is no average time-period for the second and third price spikes, they can be anywhere between months to an entire year. However, we found that some of these price spikes are correlated to dates with significant shifts in the cryptocurrency market (such as the dates when BTC jumps 24% in one day) and thus gave us inferential errors when looking at the fluctuations of market capitalization for a given ICO and the price.
Our current neural network predictor is an LSTM that uses historical price data as the input features. Since we have our general DQN architecture built, we next plan to expand on this predictor to take in more features that might be helpful in predicting expected reward. Our current predictor performs best on hourly data, so we are also interested in modifying this predictor to better handle the volatility of 10 minutely price data.
Algorithmically trading cryptocurrencies.
We looked at different approaches to algo-trading cryptocurrencies. The first stage was to get data: we coded and ran a data-logging server on the EC2 cloud which logged the minutely order book data across 17 different crypto exchanges. Next, we started an initial survey of differences between exchanges. Key findings included significantly different transaction fees (ranging from -0.1% to 1%), trading in different real currencies (USD, ZAR, CNY), differences in volatility and trading volume, and significant differences in the valuation of BTC based on exchange rates (for example, some exchanges traded BTC 10% higher than other exchanges – arbitrage trading this away is challenging, because capital needs to be moved between different real currencies, and deposited into a different bank account, for each trade).
We began a preliminary analysis of naive trading techniques: using nearest neighbors and linear regression to create estimators for future price changes, and trading accordingly. We found that transaction fees can make a significant difference to the level of profitability of high frequency trading algorithms.
Next we created a live paper trader to test these algorithms in real time – we found that the real time trading performance (vs performance on historical data) was much lower: this was because of small assumptions made when running the algo trader on historical data (for example, not taking into account that the bid price does not equal the ask price on the exchange).
After this, we started to look at more sophisticated time series models, including VAR models and the use of recurrent neural networks. While these models would give positive return, these tests were done during the time the BTC was increasing rapidly, so it was difficult to evaluate the success of these trading algorithms relative to a realistic baseline.
We also took a slight digression to look at how interconnected different exchange prices were. We did this by building a few statistical models and fitting them to the orderbook data we collected from our datalogger. In general, we found strong qualitative relationships that Biter and Bithumb strongly influence the prices on other cryptocurrency exchanges in the short term (2 minute intervals). We also saw evidence that on Biter and Bithumb, the price movement, rather than the full orderbook, was most influential in impacting price movements on other exchanges in the short term.
A key learning experience from this is how important it is to get small details in the trading model for high frequency trading correct – otherwise the performance forecasts can be highly inaccurate. For example, assuming the historical price was the “price” affected our results when real-time trading, because we did not have the historical information of the orderbook’s bid and ask prices.
We want to continue testing and refining our algorithm, especially during a period when bitcoin is not increasing rapidly. Otherwise, it’s difficult to separate the effectiveness of our algorithm against the overall increase of the market, and it often also makes the most sense to just hold rather than actively trade currency during those price surges.
An education platform for beginner cryptocurrency investors. Project Website
We started by identifying our group’s goal – we were all interested in investing and wanted to learn more about how to best invest our funds and where we can find more information about investing. We began by mapping out the most popular cryptoasset investments and analyzing their growth, value proposition, application, and tentative futures to see if there were any patterns amongst the cryptoassets that had grown to be the most popular.
As we moved deeper into the cryptoasset analysis, we decided to form an investment group to map out how well our newly-gotten information and instincts would fare in the cryptoasset investment market. However, upon midterm check-ins, we realized that this approach didn’t leave much lasting impact - we’d be the only ones to benefit from this type of approach. We decided to pivot to addressing a key point that had consistently come up during our research - it’s very difficult to cement the fundamentals of blockchain and set up an investing flow, so we decided to build an education platform that helped with exactly that.
We built a website, written for the complete novice, about how to invest in cryptocurrency, with the goal of making it a more accessible investing space.
Given the recent hype around this market, hopeful investors may dive in and throw money at market in hopes of making a positive return. However, this is a terrible (and sometimes financially-ruining) investment strategy, and as a result making sure people who visit our site are strongly encouraged to follow good investment practices. Without these practices, it doesn’t matter how good or bad a cryptocurrency is - an investor is going to end up with a net loss.
There’s a lot of work behind the setup to invest in the first place. You have to know what you want to invest in, as different exchanges have different topologies. There are fees and costs on all platforms - knowing when to transfer and where to purchase/sell cryptoassets is currently an art of its own. Additionally, concepts such as public and private keys, and the greater concept of setting up a wallet, as well as how to truly protect sensitive information (saving your private key in OneNote isn’t secure…) are generally new concepts to people who have no interaction with blockchain outside of their interest in investing. Addressing these needs is an important part of our investment platform.
Understanding exchange topology is an extremely important part of being a smart investor. Each exchange has its strengths, weaknesses, fees, and limitations, and knowing where to do business, when to transfer business, and how and where to make a sale all require an understanding of the major exchanges and their properties. This is an especially difficult area to tackle because these options and limitations change both based on the location from which an investor is operating and the regulations in that area regarding cryptoassets.
Our group performed analyses of a large number of assets, which we tackled according to their investment popularity, as a part of our efforts to create an investment strategy before we pivoted towards an education platform. We quickly realized the vast majority of these “investments” weren’t meant to be currency in any way, shape or form. They were made for communication, computation, or otherwise resource-exchange-based phenomena when people were attempting to spectate on them like guessing which currency will gain popularity. While there is meaning in value - after all, if any one of these networks are utilized for their purpose, the tokens will increase in value as demand for them increases - the way the assets are prospected and understood aren’t always correct. Understanding the purpose of an asset and how to value it is an important part of entering the cryptoasset investment market.
We’d like to see more acknowledgement of the potential of blockchain (and its evolutions like Iota’s Tangle) for future technology. While blockchain is not an universal solution to every existing technology problem out there, it is very useful in democratizing and decentralizing many existing industries, and thus inherently disruptive to any industry that it can be applied to. Blockchain will definitely play a big role in the future of the digital age. We missed the mark with the boom of the internet - let’s not repeat mistakes.
We want to continue testing and refining our algorithm, especially during a period when bitcoin is not increasing rapidly. Otherwise, it’s difficult to separate the effectiveness of our algorithm against the overall increase of the market, and it often also makes the most sense to just hold rather than actively trade currency during those price surges.
Creating a better cryptoasset valuation system.Project Website
Our group began this semester with some exposure to blockchain and cryptocurrency but little formal technical training in the space. Initially, we sought to focus on smart contracts and applications; in this vein, we set out to learn smart contract programming with Solidity on the Ethereum blockchain. We quickly realized that we would be more interested in studying the space at a higher level and developing a better understanding of the crypto landscape in terms of coin valuations and typologies. This motivation was in part spurred by the guest speaker and current HBS student Homan Mohammadi, who has spent the last few years studying the space more in depth.
After a few meetings, we began working closer with Homan on a few key areas: 1) establishing a more fundamental approach to cryptoasset valuations, and 2) developing a typology for the different kinds of coins being made available. There is immense opportunity in applying traditional economic and financial thinking to the cryptoasset space, and the dearth of research in this nascent field makes the opportunity relatively accessible.
We focused on building a better metric, by evaluating a metric called NVT (Network Value to Transactions Ratio) that is analogous to the PE ratio for a stock. The PE ratio represents a ratio of price to “health” of a given financial instrument, where health for stocks is calculated as the company’s earnings. For a cryptoasset like bitcoin, “health” can be thought of as network transaction volume. If people are using Bitcoin for more of their transactions, this justifies a higher price.
The fundamental value of a cryptoasset like Bitcoin can be thought of as either a medium of exchange or a store of value. Some of the ways in which we considered refining the NVT metric were: using circulated supply vs total supply of coins; and changing how we represented transactions (to avoid overcounting in cases where the transaction was represented as going to two addresses).
Traditional finance theory has relevance to the world of cryptoassets. We were able to leverage research in stock valuation (like the idea of the PE ratio) and portfolio rebalancing (the idea of dynamic resizing) when completing our project. One main metric we examined in this regard was NVT. A high NVT indicates that a cryptoasset's on-chain transaction volume is valued more highly. This generally means that the network is experiencing rapid growth and its future returns are valued more highly by the market, or that there is a potential bubble in price if this increased valuation is unsupported. Our project partially tried to better separate these situations through better analysis of transaction activity and fundamental drivers. NVT is calculated by the network value (total market capitalization) divided by the transaction volume (in dollars) on the blockchain. The transaction volume is only an estimate derived from activity seen on blockchains.
There is currently no clear consensus on exactly how cryptoassets differ from other kinds of assets. Over the course of the semester, we read many different kinds of blogs and books on cryptoasset valuation, and found lots of diversity in how the crypto community is thinking about these problems. Since it is such a nascent space, there is opportunity for students to be part of shaping the theory of valuation, even through data collection or analysis. There is solid research establishing cryptocurrencies as a separate asset class from existing classes, but within this specific asset class there are also distinctions to be made.
Cryptocurrencies are the most similar to existing currencies/commodities and can act as either a store of value or medium of exchange. Though many subdistinctions can be made within this group, such currencies might include Bitcoin, Ether, Litecoin, or Monero.
Cryptoutilities comprise tokens and other applications which provide a service and for which valuations are based off the value of the future network. Such utilities might include Augur, Gnosis, Siacoin, or Golem.
Finally, cryptosecurities are a class of coins that essentially provide liquidity and transparency to an existing asset or asset class. Such securities might be venture capital or real estate investment funds, where value is driven by underlying assets. These behave like traditional securities, albeit with non-traditional fundraising mechanisms.
We see considerable opportunity for Harvard academics to weigh in on some of the valuation techniques and economic implications of the technology. As discussed, the emerging nature of this field of research provides an opportunity for newcomers to quickly get up to speed and to bring along interdisciplinary insights that can positively affect the space.
The things that most everyone in the course found to be true.
The number of resources that explain or allow development on blockchains has exploded, and with them, the number of people interested in or working in the blockchain space. It is no longer unreasonable to have multiple annual conferences about blockchain technologies, academic offerings, or research divisions in the space.
Though not always the earliest adopters of technologies not invented in university, the nation’s (second) best schools are offering undergraduate courses and blockchain conferences, and publishing blockchain research. While taking ES95r, students would travel to MIT or Berkeley for conferences and meetups.
In most computer science topics, students need only learn the technology, as they enter with an innate understanding of the concept: one may learn JavaScript, but already be innately familiar with the web; one can learn about cryptography, but already understand why one needs password-secured information; one can even learn the basic algorithms, but intuitively why searching information efficiently matters. Blockchain is at least as important as these other topics, yet because it is slightly removed –– it depends on cryptography, it depends on algorithms, and it depends on the internet networks —— one must learn both to build with and what can be built.
Current blockchain technology provides an opportunity for many low-level innovations –– faster transfer protocols, data integrations, identity systems –– but to build them, one needs high-level innovation –– conceptual models, evaluative metrics, and new ways of thinking about networks. We found this to be the case especially in non-investment opportunities: one must understand how a social or transport network functions from above, but also work to create the technology that allows that function.
While blockchain is new, analogous concepts do exist; the trick is separating “what is” from “what seems to be.” When applying investment strategies, brainstorming product markets, or building a new protocol, one can look to past successes, but one must be careful to test and prove that such past innovations are actually applicable.
Blockchains are based on principles of decentralization, which contributes a large part of the technology’s potential power, but it also means that information about specific blockchains, protocols, or products can be difficult to find. We found that clubs, academic courses, and conferences were the best means by which to learn the most, as someone else almost always had useful thoughts to contribute when problems or questions arose.
At worst, blockchain will replace many of today’s financial systems and remain the technology behind hundreds of billions of dollars of digital transactions. At best, Blockchain will transform society with an impact comparable to the internet revolution. In either case, it’s a technology that should not and cannot be ignored.
The students of ES95r’s inaugural Blockchain section created many things: new investment strategies and metrics, attempts at solving the hard oracle problem, and frameworks for thinking about the products of this new technology. We learned about cryptography, digital identity verification, network flows, and distributed systems, from the internet, guest speakers, and each other.
Although Paul Bottino’s guidance was helpful, we otherwise had limited support: no undergraduate blockchain community, no course curriculum, and no dedicate educational resources. ES95r:Blockchain was a classic example of Harvard students being Harvard students, determined to not let lack of established resources stop us from learning what we wanted to learn. The excitement from other industry and academic institutions, combined with the hard-working brainpower always present on Harvard’s campus, makes one believe that Harvard has massive untapped ability to advance blockchain technology and solve problems from identity to healthcare to law to energy.
One can only imagine just how innovative Harvard could be in the blockchain space, if it simply tried.
(searches for "Blockchain" over 2012-2017; source: Google Trends)
© Shuya Gong, Joel Kwartler, and the students of Fall 2017's ES95r:Blockchains