tanchao's blog

About Me

My name is Chao Tan. I build platforms that are reliable and performant. I enjoy art, food, basketball, swimming and skiing.

I'm currently an engineer at Snowflake in Bellevue, where I work on data privacy and governance. Before that, I worked on Amazon Alexa owning APIs, services and 3P orchestration.

Links

@chaostan on Twitter

2024-03 retro

Monthly retro over:

Delivery

* classification udf -> udtf done done, 10x improvement.
* automatic tag propagation engine design almost done, satisfied about the final algothrims for performance and eventual consistency, and the full scan trade offs we thought through, eventually a trade-off on time v.s. memory.

Next priority

* implementation of the continuous propagation
* observability of the propagation
* lots of data analysis and SQLs over the dependency graph and tag associations, building a new dashboard for object tagging

Learning from work and study

* performance improvement at architecture level, focus on gain that reduce the growth delta, e.g. from O(N) to O(logN)
* how to write better doc
* opinionate, then focus on all the supporting data; if not enough supporting data, revise opinion
* ceo change resulted such a huge impact on $SNOW, as well as my financial plan
* an admired architect left the team, they taught me a lot, even with the last stamp

Look around industry findings

* DBRX is not an efficient move, we chose mistral for a reason, for the price/cost efficiency.
* AI Component System is a good paper to read, I think LLM is like the OS in future world, more importantly, wait and see who will invent the DOS and the WINDOW. 

tanchao's blog

About Me

Links

2024-03 retro

Other Posts