Lyrics-free Singing Voice Generation
The conventional approach to generate singing voices is through singing voice synthesis (SVS) techniques. A human user feeds lyrics and MIDI scores (a sequence of notes) to a well-trained SVS model, and the model generates audio recordings following the given lyrics and scores faithfully. The synthesis models have little freedom deciding what to “sing.” In […]
MuseMorphose: Music Style Transfer with A Transformer VAE
At Taiwan AI Labs, we are constantly pushing the frontier of deep music generation models. In the past year, we have rolled out Guitar Transformer (blog), which can compose human-readable guitar tabs with plausible fingerings, and Compound Word Transformer (blog), which vastly accelerated model training and inference thanks to carefully re-engineered music representation. Today, proudly making its debut […]
It is Enough to Take Only One Image: Re-exposure Images by Reconstructing Radiance Maps
Fig. 1 (Left) original image (Middle) result image of adjusting the brightness directly (Right) result image of adjusting the exposure on our reconstructed radiance map. Nowadays, more and more people like to take pictures with smartphones and post on social media to share beautiful photos with their friends. Usually, they do some image editing before […]
360° Depth Estimation
360° videos provide an immersive environment for people to engage, and Taiwan Traveler is a smart online tourism platform that utilizes 360° panoramic views to realize virtual sightseeing experiences. To immerse users in the virtual world, we aim to exploit depth information to provide a sense of space, enabling tourists to better explore scenic attractions. […]
The Magic to Disappear Cameraman: Removing Object from 8K 360° Videos
360° video, also known as immersive video, has been increasingly popular and drawn great attention nowadays since it unlocks unlimited possibilities for content creators and encourages viewer engagement. One representative application which exploits the power of 360° videos is Taiwan Traveler. Taiwan Traveler, Taiwan’s first smart tourism platform developed by Taiwan AI Labs, aims to […]
Azoospermia with Deep Learning Object Detection
Introduce Azoospermia Azoospermia is a medical term implying the condition of no measurable sperm in a man’s semen. It is also the main challenge in male infertility. Azoospermia could be divided into two classes, including obstructive azoospermia(OA) and non-obstructive azoospermia(NOA). For OA, the testicular size and serum hormone profile are normal. On the other hand, […]
Compound Word Transformer: Generate Pop Piano Music of Full-Song Length
Over the past months, we attempted to let transformer models learn to generate full-song music, and here is our first attempt towards that, the Compound Word Transformer. A paper describing this work is going to be published as a full paper at AAAI 2021, the premier conference in the field of artificial intelligence. You can […]
Label360: An Annotation Interface for Labeling Instance-Aware Semantic Labels on Panoramic Full Images
We developed an annotation tool—Label360 to solve the distortion and instance matching issues across different viewing aspects in
spherical image annotations. A post-processing algorithm was introduced to generate distortion-free annotations on equirectangular
images.
DockCoV2: a drug database against SARS-CoV-2
The current state of the COVID-19 pandemic is a global health crisis. From December 2019 to September 2020, SARS-CoV-2 has infected over 32 million people, and caused more than one million deaths worldwide. To fight the novel coronavirus, one of the best-known ways is to block enzymes essential for virus entry or replication. The Genomics […]
Guitar Transformer and Jazz Transformer
At the Yating Music Team of the Taiwan AI Labs, we are developing new music composing AI models extending from our previous Pop Music Transformer model (see the previous blog). In October 2020, we are going to present two full papers documenting some of our latest result at the International Society for Music Information Retrieval […]