Skip to content
View yiyexy's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yiyexy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yiyexy/README.md

Hi there, I'm Yin Xie πŸ‘‹ Email

πŸš€ About Me

I'm a Deep Learning Algorithm Engineer specializing in cutting-edge AI technologies. My passion lies in pushing the boundaries of computer vision and multimodal AI systems.

πŸ”¬ Research Interests

  • πŸ–ΌοΈ Computer Vision - Advanced visual understanding and perception
  • πŸ€– Vision-Language Models - Large-scale multimodal AI systems
  • ⚑ Model Optimization - Compression, acceleration, and efficient deployment
  • 🌐 Distributed Training - Scalable deep learning infrastructure

πŸ’‘ Current Focus

My recent work centers on:

  • Visual representation learning and self-supervised techniques
  • End-to-end facial feature pretraining systems
  • Advanced pretraining strategies for vision-language models
  • Publishing research in top-tier AI conferences
  • Contributing to impactful open-source projects

πŸ’¬ Let's Connect!

I'm always open to:

  • 🀝 Collaborating on innovative AI projects
  • πŸ’‘ Discussing cutting-edge research ideas
  • πŸ“š Sharing knowledge and best practices
  • 🌟 Contributing to open-source initiatives

Feel free to reach out via email or connect with me here on GitHub!

Pinned Loading

  1. deepglint/Victor deepglint/Victor Public

    ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs

    Python 28 1

  2. deepglint/unicom deepglint/unicom Public

    Large-Scale Visual Representation Model

    Python 703 33

  3. VLM-review VLM-review Public

    11

  4. deepglint/MVT deepglint/MVT Public

    Margin-based Vision Transformer

    64 2

  5. EvolvingLMMs-Lab/LLaVA-OneVision-1.5 EvolvingLMMs-Lab/LLaVA-OneVision-1.5 Public

    Fully Open Framework for Democratized Multimodal Training

    Python 714 57

  6. EvolvingLMMs-Lab/OneVision-Encoder EvolvingLMMs-Lab/OneVision-Encoder Public

    The first HEVC style Vision Transformer with advanced multimodal capabilities

    Python 82 3