Alternatives to Jupyter Notebook for Python and more!

Jupyter emerged and gained respect for being an easy-to-install solution, in addition to bringing the proposed use to facilitate coding and visualization of the code, a form of interactive computing aimed at usability. With the success of Python, other market tools started to support this language and compete directly with Jupyter.

#Now that I spoke well about jupyter, let’s go to the competition… :)


Photo by Nicole Wolf on Unsplash

Boost your JupyterLab with these tips!

This publication is a list of extensions that can facilitate the use of the JupyterLab IDE; here are the tips:

# Variable Inspector

This extension shows the variables used and their values.


Photo by Perchek Industrie on Unsplash

Improving the performance of Apache Cassandra, Best practices, and a little more! :)

Cassandra is a NoSQL database developed to ensure rapid scalability and high availability of data, being open source and maintained mainly by the Apache Foundation and its community.
Its main features are:

  • “Decentralization”: all nodes have the same functionality.
  • “Resilience”: several nodes replicate data; it also supports replication by multiple data centers.
  • “Scalability”: adding new nodes to the cluster is fast and does not affect the system's performance; there are systems that use Cassandra with thousands of nodes today.

According to Apache, we can look at Cassandra as being:

“The Apache Cassandra database is the right choice when you need…


Photo by Perchek Industrie on Unsplash

Melhorando a performance do Apache Cassandra, Melhores práticas e um pouco mais! :)

O Cassandra é um bancos de dados NoSQL, desenvolvido para garantir rápida escalabilidade e alta disponibilidade dos dados, sendo de código aberto e mantido principalmente pela fundação Apache e sua comunidade.

Suas principais caraterísticas são:

  • “Decentralização”: todos os nodes possuem as mesma funcionalidades.
  • “Resiliência”: os dados são replicados por vários nodes, suporta também replicação por múltiplos datacenters.
  • “Escalabilidade”: adicionar novos nodes ao cluster é rápido e não afeta a performance do sistema, existem sistemas que utilizam o Cassandra com milhares de nodes atualmente.

Segundo a Apache podemos olhar para o Cassandra como sendo:

“O banco de dados Apache Cassandra é…


Photo by Fotis Fotopoulos on Unsplash

RStudio in the Docker container environment!!

Docker and containers

Each time we need to create segregated and resilient environments capable of supporting from small applications to distributed databases, making intelligent governance of the computational resources of infrastructure and network simultaneously that we scale solutions horizontally, depending on the need of the scenario covered.

As a solution to this dilemma, containers appeared, according to Microsoft, they are “similar to virtual machines, but they do not create an entire virtual operating system. Instead, Docker allows the application to use the same Linux kernel as the system on which it is running. ”

Docker is one of the main…


Photo by Possessed Photography on Unsplash

How to suit LGPD using MLOps, Data catalog, and more?

We are adapting to the General Data Protection Law (LGPD); this new law has as main objective to guarantee data privacy and reliability, but how is the data area adapted to this new reality? What are the strategies adopted? What about Artificial Intelligence (AI)?

These are some of the strategies that are happening in the data area:

  • Infinite Forms?
    This strategy aims to create one or more forms to manage who accesses and where the data and its sources are. …

Photo by Boba Jaglicic on Unsplash

Discover Apache Hive, its power, and more !! :)

#Big data

More and more, we have to deal with large volumes of data that are created and need to be used at an unbelievable Speed, having a huge variation, almost impossible for a human being to follow, to be concerned with its Veracity, and to be able to add value to the business in a way effective. (The 5 V of Big data).

To deal with this, the term “Big data” came up, and several solutions to deal with these problems in different scenarios, such as Apache Hive.

#Hive

According to IBM, “Apache Hive is open source data warehouse…


Photo by Kelly Sikkema on Unsplash

Deploy and Run Jobs Spark on Scala in GCP is easy!!

This is a simple tutorial with examples of using Google Cloud to run Spark jobs done in Scala easily! :)

  1. Install the Java 8 JDK or Java 11 JDK

To check if Java is installed on your operating system, use the command below:

java -version

#Depending on the version of Java, this command can change … :)

If Java has not been installed yet, install from these links: Oracle Java 8, Oracle Java 11, or AdoptOpenJDK 8/11; always checking the compatibility between the versions of JDK and Scala following the guidelines of this link: JDK Compatibility.

2. Install Scala

Ubuntu:


Photo by Marc Mintel on Unsplash

DevOps Guide cycle, Tools, certification free, and more… :)

According to AWS, we can understand DevOps as “a combination of cultural philosophies, practices, and tools that increase a company’s ability to distribute applications and services at high speed: optimizing and improving products at a faster rate than companies using traditional processes. software development and infrastructure management. This speed allows companies to better serve their customers and be able to compete more effectively in the market. “

# Infrastructure as code

This is a practice in which we can programmatically provision and configure the infrastructure, using techniques from the software engineer, such as version control and continuous integration.

From automation…


Photo by André François McKenzie on Unsplash

Analyzes the price of Bitcoin in Python using fbprophet?

Warning: This publication is not a recommendation for investing or not investing in bitcoins, if you want to know more about it, look for a duly certified specialist with experience in the subject, please!

Motivation: I was studying time series and seeing the ease of using the Python library for time series analysis, the fbprophet, I decided to apply it to a data set, via LinkedIn I received the news that the virtual bitcoin currency had a value of 100 thousand and this led me to think using fbprophet, what would be the value of that currency in a year?

Josue Luzardo Gebrim

I am sharing my opinion and what little I know of eventually here.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store