Authors: Duy Ho, Bhuvan Chennoju
“Metaverse”, a new form of social life via virtual reality, is a new ambitious vision set by Meta (formerly known as Facebook) that allows users to socialize in a virtual world. It is a groundbreaking concept in social networking in which we will perform many activities such as working, playing, studying, and interacting with each other in an immersive way. Thus, our project, MetaLens, plans to focus primarily on a vision in the future where daily lives are enhanced with technology and Artificial Intelligence (AI). Our lives will be surrounded by robots, drones, and other automatic systems that make our tasks a lot more convenient. Via AI, Deep Learning, Sentiment Analysis, and Object Recognition, MetaLens aims to help not only the citizens but also the government be aware of their citizen’s well-being and the city’s overall performance to derive appropriate solutions.
- Leading platform for 2D and 3D game and application development
- Realistic physics mechanism and interaction (collision, gravity, speed, acceleration, …)
- High-quality rendering and graphics
- Cross-platform compatibility (mobile, android, PC, web, HoloLens, Oculus Quest, Oculus Rift, Vive, SteamVR, and Valve)
- Abundant assets (standard, free, and paid) from Unity and other third parties
- Strong community support and maintenance
- Python: best framework for data science and deep learning
- Keras: one of the most popular frameworks for building DL models
- Jupyter Notebook: interactive tool to work and share Python code efficiently
- Ngrok: Convenient to quickly and effectively establish connection to front end.
- Detectron2: object detection and segmentation library with pretrained models ready for transfer learning and finetuning.
https://susanqq.github.io/UTKFace/ ~ 20,000 face images with gender, age, and ethnicity
Google Street View API https://developers.google.com/maps/documentation/streetview/overview ~ Houses, Utility Poles + Street Lights
COCO Dataset (80 objects)
https://cocodataset.org Vehicles
Road Damage Dataset
https://github.com/sekilab/RoadDamageDetector Road
UTK has 2 versions:
- Cropped + Grayscaled
- Uncropped + Colored
- Data simplification
- Data Extraction
- Data Structuring
- Close-up Face version (limited vision cues)
- Simple < 10-layer CNN
- Low accuracy for each category (~50-60%)
- Colored Portrait version (more hints about clothes, skin tone, … )
- Transfer Learning with ResNet-50 pretrained on ImageNet and Early Stopping
- Increased accuracy to 70-90%
- Input: Image
- Output: Age, Gender, Ethnicity, and Sentiment (Emotion)
Unity: PC version, Web, and VR
{
"Age": "Middle-Aged",
"Age Estimate": 55,
"Ethnicity": "Indian",
"Gender": "Male",
"Sentiment": "Neutral"
}
Video: https://youtu.be/wus7FLhRER4
Colab: https://colab.research.google.com/drive/1E7223DY3RbS-OVM6cZ3VO7MKQxmZ5XZe?usp=sharing
GitHub (VR Version): https://github.com/benamreview/HackARoo-Fall2021-VR-Assets
GitHub (PC Version): https://github.com/benamreview/HackARoo-Fall2021-Assets
PPT Slides (with animations & annotations): https://mailmissouri-my.sharepoint.com/:p:/g/personal/dhh3hb_umsystem_edu/ESgtQ36AxjpGjiy9yIZMPxEBW0nTIMS14dP1CtJY8M9EEA?e=Dresha