Real Estate
Zillow Launches $1 Million Zestimate Competition for Data Scientists
Zillow publishes Zestimates for 110 million Ì첩ÈüʹÙÍøs; Participants in Kaggle-hosted contest will compete for $1.2 million in total prizes, the largest prize pool ever awarded in a machine learning competition
Zillow, the leading real estate information and Ì첩ÈüʹÙÍø-related marketplace, has launched Zillow® Prize, an award of $1 million to the first person or team who can most improve the Zestimate® algorithm, Zillow’s proprietary Ì첩ÈüʹÙÍø valuation tool. Zillow Prize calls for data scientists, engineers and visionaries to compete to improve automated Ì첩ÈüʹÙÍø valuations of 110 million Ì첩ÈüʹÙÍøs across the U.S.
Designed to be a starting point to help people estimate the value of a Ì첩ÈüʹÙÍø when it launched in 2006, the Zestimate Ì첩ÈüʹÙÍø valuation marked the first time Ì첩ÈüʹÙÍøowners had instant access to information about their Ì첩ÈüʹÙÍøs’ estimated values, for free. Prior to the Zestimate, only appraisers, mortgages lenders and professional real estate agents had access to computer valuations of Ì첩ÈüʹÙÍøs â€� never Ì첩ÈüʹÙÍøowners or prospective buyers.
To Ì첩ÈüʹÙÍøowners, sellers and buyers, the Zestimate Ì첩ÈüʹÙÍø valuation remains an important data point. Combined with other information, like recent Ì첩ÈüʹÙÍø sales, and the guidance of real estate professionals, the Zestimate helps consumers make smarter financial decisions about their Ì첩ÈüʹÙÍøs.
To data scientists, the Zestimate Ì첩ÈüʹÙÍø valuation is known as the ultimate algorithm, one of the highest-profile, most accurate and sophisticated examples of machine learning. Zillow Prize will mark the first time that a portion of the proprietary data that powers the Zestimate Ì첩ÈüʹÙÍø valuation will be available to individuals outside of Zillow.
Zillow’s data science team continually works to improve the accuracy of the Zestimate Ì첩ÈüʹÙÍø valuation, as measured by how close the Zestimate is to the eventual sale price of a Ì첩ÈüʹÙÍø. The U.S. median absolute percent error currently stands at 5 percent, improved from 14 percent in 2006.
“We still spend enormous resources on improving the Zestimate, and are proud that with advancements in machine learning and cloud computing, we’ve brought the error rate down to 5 percent nationwide,” said Stan Humphries, creator of the Zestimate Ì첩ÈüʹÙÍø valuation and Zillow Group chief analytics officer. “While that error rate is incredibly low, we know the next round of innovation will come from imaginative solutions involving everything from deep learning to hyperlocal data sets — the type of work perfect for crowdsourcing within a competitive environment.”
The contest is being administered by Kaggle, a platform designed to connect data scientists with complex machine learning problems. It will be staggered into two rounds, the public qualifying round which opens today and concludes Jan. 17, 2018i and a private final round that kicks off Feb. 1, 2018 and ends Jan. 15, 2019ii.
Contest participants have until Oct. 16, 2017 to register for the qualifying round, download and explore the competition data setiii, and develop a model to improve the Zestimate residual error. The top 100 teams from the qualifying round, those whose solutions most reduce the difference between the Zestimate Ì첩ÈüʹÙÍø valuation and the actual sale price of the Ì첩ÈüʹÙÍøs within the dataset, will be invited to participate in the final round and compete for the $1 million dollar prize.
In the final round, the winning team must build an algorithm to predict the actual sale price itself, using innovative data sources to engineer new features that will give the model an edge over other competitors. The Ì첩ÈüʹÙÍø value predictions from each algorithm submission will be evaluated against real-time Ì첩ÈüʹÙÍø sales in August through October 2018. To take Ì첩ÈüʹÙÍø the $1 million dollar grand prize, the winning algorithm must beat Zillow’s benchmark accuracy on the final round competition data setiv and enhance the accuracy further than any other competitor. A $100,000 second place prize and $50,000 third place prize will also be awarded in the final round. A total of $50,000 will also be awarded to the top three ranking teams in the qualifying round.
Zillow publishes Zestimates on more than 110 million Ì첩ÈüʹÙÍøs across the country based on 7.5 million statistical and machine learning models that examine hundreds of data points on each individual Ì첩ÈüʹÙÍø.
To calculate the Zestimate Ì첩ÈüʹÙÍø valuation, Zillow uses data from county and tax assessor records, and direct feeds from hundreds of multiple listing services and brokerages. Additionally, Ì첩ÈüʹÙÍøowners have the ability to update facts about their Ì첩ÈüʹÙÍøs and see an immediate change to their Zestimate. More than 70 million Ì첩ÈüʹÙÍøs on Zillow have been updated by the community of users.
Zillow
Zillow® is the leading real estate and rental marketplace dedicated to empowering consumers with data, inspiration and knowledge around the place they call Ì첩ÈüʹÙÍø, and connecting them with the best local professionals who can help. Zillow serves the full lifecycle of owning and living in a Ì첩ÈüʹÙÍø: buying, selling, renting, financing, remodeling and more. In addition to Zillow.com®, Zillow operates the most popular suite of mobile real estate apps, with more than two dozen apps across all major platforms. Launched in 2006, Zillow is owned and operated by Zillow Group (NASDAQ:Z and ZG) and headquartered in Seattle.