I'm a third-year undergraduate at The Hong Kong Polytechnic University, supervised by Prof. Bing Wang. I am interested in what representations is needed to achieve visual intelligence. To date, I have explored this question through spatial reasoning via explicit graphs decomposition, and world models where representation naturally emerges. I hope to develop models that can learn predictive representations that allows AI systems to understand and reason about the physical world.
When it comes to projects, I treat each and every of them as a work of art, a few of which I am proud of; a full list can be found in my github repo. Outside of university, I enjoy filming and video production. I once grew a channel past 100K subscribers. I have also independently produced a documentary and an AI short film, with the latter winning an award at the 15th Beijing International Film Festival (BJIFF).
I'm applying to PhD programs starting Fall 2027. I look forward to exploring questions surrounding spatial understanding in greater depth.